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HI1A-A2.I BINDING PEPTIDES AND THEIR USEg 

The presenc application is a conninuacion in Darr of 
USSN 08/159,184, which is a concinuaticn in pare of " USSN 
08/073,205, which is a conninuacion in pare of USSN 
08/027,146, all of which are incorporaced herein by reference. 

BACKGROUND OF. THE INVENTION 
The present invention relates to compositions and 
methods for preventing, treating or diagnosing a number of 
pathological states such as viral diseases and cancers. In 
particular, it provides novel peptides capable of binding 
selected major histocompatibility complex (MHC) molecules and 
inducing an immune response . 

MHC molecules are classified as either Class I or 
Class II molecules . Class II MHC molecules are expressed 
primarily on cells involved in initiating and sustaining 
immune responses, such as T lymphocytes, B lymphocytes, 
macrophages, etc. Class II MHC molecules are recognized by 
helper T lymphocytes and induce proliferation of helper T 
lymphocytes and amplification of the immune response to the 
particular immunogenic peptide that is displayed. Class I MHC 
molecules are expressed on almost all nucleated cells and are 
recognized by cytotoxic T lymphocytes (CTLs) , which then 
destroy the antigen -bearing cells. CTLs are particularly 
important in tumor rejection and in fighting viral infections. 

The CTL recognizes the antigen in the form of a 
peptide fragment bound to the MHC class I molecules rather 
than the intact foreign antigen itself. The antigen must 
normally be endogenously synthesized by the cell, and a 
portion of the protein antigen is degraded into small , peptide 
fragments in the cytoplasm. Some of these small peptides 
translocate into a pre-Golgi compartment and interact with 
class I heavy chains to facilitate proper folding and 
association with the subunit 02 microglobulin. The 
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pep-:Lde-MHC class I con-.pisx is -hen rcured co che csil surface 
for expression and pocenr il reccgnirion by specific CTLs. 

Invescigations of the cryscai scmccure of the human 
MHC class I molecule, HLA-A2.1, indicate that a peptide 
binding groove is created by the folding of the al and a2 
domains of the class I heavy chain (Bjorionan et al.. , Naturg 
329:506 { 1987). In these investigations, however, the 
identity of peptides bound to the groove was not determined. 

Buus et al., Sciencg 242:1065 (1988) first described 
a method for acid elution of bound peptides from MHC. 
Subsequently, Rammensee and his coworkers (Falk et al., Nature 
351:290 (1991) have developed an approach to characterize 
naturally processed peptides bound to class I molecules. 
Other investigators have successfully achieved direct amino 
acid sequencing of the more abundant peptides in various HPLC 
fractions by conventional automated sequencing of peptides 
eluted from class I molecules of the B type (Jardetzky, et 
al., Naturg 353:326 (1991) and of the A2.1 type by mass 
spectrometry (Hunt, et al., Ssigjisg 225 : 1261 (1992). A review 
of the characterization of naturally processed peptides in MHC 
Class I has been presented by Rdtzschke and Falk (Rotzschke 
and Falk, Immunol . Today 12:447 (1991) 

Sette et al., Proc. Wa nl . Aeari. Sc± . nga 86:3296 
(1989) showed that MHC allele specific motifs could be used to 
predict MHC binding capacity. Schaeffer et al . , Proc. Natl, 
Ag^d. Sgi. USA 86:4649 (1989) showed that MHC binding was 
related to immunogenicity . Several authors (De Bruijn et al.. 
Bur. J. Immunol ., 21:2963-2970 (1991); Pamer et al., 991 
Sa£li£S 353:852-955 (1991)) have provided preliminary evidence 
that class I binding motifs can be applied to the 
identification of potential immunogenic peptides in animal 
models . Class I motifs specific for a number of human alleles 
of a given class I isotype have yet to be described. It is 
desirable that the combined frequencies of these different 
alleles should be high enough to cover a large fraction or 
perhaps the majority of the human outbred population. 

Despite the developments in the art, the prior art 
has yet to provide a useful human peptide -based vaccine or 
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cnerapeuc-c agenc based on chis work. The presenc invencicr. 
prcvides rhese and ochsr advanrages . 

SU>5MARY OF THE INVENTION 
The present invention provides compositions 
comprising iinmunogenic peptides having binding motifs for HIiA- 
A2.1 molecules. The immunogenic peptides, which bind to the 
appropriate MHC allele, are preferably 9 to 10 residues in 
length and comprise conserved residues at certain positions 
such as positions 2 and 9. Moreover, the peptides do not 
comprise negative binding residues as defined herein at other 
positions such as positions 1, 3. 6 and/or 7 in the case of 
peptides 9 amino acids in length and positions i, 3, 4, 5, 7 
8 and/or 9 in the case of peptides 10 amino acids in length. 
The present invention defines positions within a motif 
enabling the selection of peptides which will bind efficiently 
to HLA A2.1, 

Epitopes on a number of immunogenic target proteins 
can be identified using the peptides of the invention. 
Examples of suitable antigens include prostate cancer specific 
antigen (PSA) , hepatitis B core and surface amtigens (HBVc, 
HBVs) hepatitis C antigens, Epstein-Barr virus antigens, human 
immunodeficiency type-l virus (HIVl) and papilloma virus 
antigens. The peptides are thus useful in pharmaceutical 
compositions for both ii^ vivo and ex vivo therapeutic and 
diagnostic applications. 



Definitions 

The term "peptide" is used interchangeably with 
"oligopeptide" in the present specification to designate a 
series of residues, typically L-amino acids, connected one to 
the other typically by peptide bonds between the alpha-amino 
and carbonyl groups of adjacent amino acids. The 
oligopeptides of the invention are less than about 15 residues 
in length and usually consist of between about 8 and about 11 
residues, preferably 9 or 10 residues. 

An "immunogenic peptide" is a peptide which 
comprises an allele -specific motif such that the peptide will 
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bind an MHC .-noiecuie and induce a CTL response. Immunogenic 
pepcides of the invention are capable of binding co an 
appropriacs KLA-A2.1 molecule and inducing a cyrocoxic T ceil 
response againsc the antigen from which the immunogenic 
peptide is derived. 

Immunogenic peptides are conveniently identified 
using the algorithms of the invention. The algorithms are 
mathematical procedures that produce a score which enables the 
selection of immunogenic peptides. Typically one uses the 
algorithmic score with a "binding threshold" to enable 
selection of peptides that have a high probability of binding 
at a certain affinity and will in turn be immunogenic. The 
algorithm is based upon either the effects on MHC binding of a 
particular amino acid at a particular position of a peptide or 
the effects on binding of a particular substitution in a motif 
containing peptide. 

A "conserved residue" is an amino acid which occurs 
in a significantly higher frequency than would be expected by 
random distribution at a particular position in a peptide. 
Typically a conserved residue is one where the MHC structure 
may provide a contact point with the immunogenic peptide. One 
to three, preferably two, conserved residues within a peptide 
of defined length defines a motif for an immunogenic peptide. 
These residues are typically in close contact with the peptide 
binding groove, with their side chains buried in specific 
pockets of the groove itself. Typically, an immunogenic 
peptide will comprise up to three conserved residues, more 
usually two conserved residues. 

As used herein, "negative binding residues" are 
amino acids which if present at certain positions (for 
example, positions 1, 3 and/or 7 of a 9-mer) will result in a 
peptide being a nonbinder or poor binder and in turn fail to 
be immunogenic i.e. induce a CTL response. 

The term "motif" refers to the pattern of residues 
in a peptide of defined length, usually about 8 to about 11 
amino acids, which is recognized by a particular MHC allele. 
The peptide motifs are typically different for each human MHC 
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allele and differ in zhe paccsrn cf zhe highly conserved 
residues and negative residues . 

The binding mocif for an allele can be defined wich 
increasing degrees of precision. In one case, all of rhe 
conserved residues are present in che correct: positions in a 
peptide and there are no negative residues in positions 1,3 
and/ or 7. 

The phrases "isolated" or "biologically pure" refer 
to material which is substantially or essentially free from 
components which normally accompany it as found in its native 
state. Thus, the peptides of this invention do not contain 
materials normally associated with their in situ environment, 
e.g., MHC I molecules on antigen presenting cells. Even where 
a protein has been isolated to a homogenous or dominant band, 
there are trace contaminants in the range of 5-10% of native 
protein which co-purify with the desired protein. Isolated 
peptides of this invention do not contain such endogenous co- 
purified protein. 

The term "residue" refers to an amino acid or amino 
acid mimetic incorporated in an oligopeptide by an amide bond 
or amide bond mimetic. 



BRIEF DESCRIPTION OF THE DRAWINGS 
Fig. 1 is a flow diagram of an HLA-A purification 

scheme. 

Fig. 2 shows a scattergram of the log of relative 
binding plotted against the "Grouped Ratio" algorithm for 9 
mer peptides. 

Fig. 3 shows a scattergram of the log of relative 
binding plotted against the average "Log of Binding" algorithm 
score for 9 mer peptides. 

Figs. 4 and 5 show scattergrams of a set of 10 -mer 
peptides containing preferred residues in positions 2 and 10 
as scored, by the "Grouped. Ratio" and "Log of Binding" 
algorithms . 
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DESCRIPTION OF THE PREFERRED EMBODIMENTS 
The present invention relates to the decenriination 
of allele- specific peptide motifs for human Class I MHC 
(someti.-nes referred to as KLA) allele suDtypes, in particular, 
peptide motifs recognized by HLA-A2.1 alleles. These motifs 
are then used to define T cell epitopes from any desired 
antigen, particularly those associated with human viral 
diseases, cancers or autoiummune diseases, for which the amino 
acid sequence of the potential antigen or autoantigen taraets 
is known. 

Epitopes on a number of potential target proteins 
can be identified in this manner. Examples of suitable 
antigens include prostate specific antigen (PSA) , hepatitis B 
core and surface antigens (K3Vc, HSVs) hepatitis C antigens, 
Epscein-Barr virus antigens, melanoma antigens (e.g., MAGE-l) , 
human immunodeficiency virus (HIV) antigens and human 
papilloma virus (HPV) antigens. 

The peptides of the invention may also be employed 
to relieve the symptoms of, treat or prevent the occurrence or 
reoccurrence of autoimuune diseases. Such diseases include, 
for example, multiple sclerosis (MS), rheumatoid arthritis 
(RA) , Sjogren syndrome, scleroderma, polymyositis, 
dermatomyositis, systemic lupus erythematosus, juvenile 
rheumatoid arthritis, ankylosing spondylitis, myasthenia 
jravis (MG) , bullous pemphigoid (antibodies to basement 
membrane at dermal. epidermal junction) , pemphigus (antibodies 
to mucopolysaccharide protein complex or intracellular cement 
substance) , . glomerulonephritis (antibodies to glomerular 
basement membrane), Goodpasture's syndrome, autoimmune 
hemolytic anemia (antibodies to erythrocytes), Hashimoto's 
disease (antibodies to thyroid) , pernicious anemia (antibodies 
to intrinsic factor) , idiopathic thrombocytopenic purpura 
(antibodies to platelets). Grave's disease, and Addison's 
disease (antibodies to thyroglobulin) , and the like. 

The autoantigens associated with a number of these 
diseases have been identified. For example, in experimentally 
induced autoimmune diseases, antigens involved in pathogenesis 
have been characterized: in arthritis in rat and mouse. 
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nacive cype-II collagen is iaencified in collagen- induced 
archricis , and mycoiDaccerial heac shock prorein in adjuvant 
arrhrinis ; thyroglobulin has been idenrified in experiniencal 
allergic chyroiditis (EAT) in mouse; acetyl choline receptor 
5 (AChR) in experimental allergic myasthenia gravis (EAMG) ; and 
myelin basic protein (MBP) and proteolipid protein (PLP) in 
experimental allergic encephalomyelitis (SAE) in mouse and 
rat. In addition, target antigens have been identified in 
humans: type- II collagen in human rheumatoid arthritis; and 

10 acetyl choline receptor in myasthenia gravis. 

Peptides comprising the epitopes from these antigens 
are synthesized and then tested for their ability to bind to 
the appropriate MHC molecules in assays using, for example, 
purified class I molecules and radioiodonated peptides and/or 
.15 cells expressing empty class I molecules by, for instance, 

immunof lucres cent staining and flow microfluorometry, peptide - 
dependent class I assembly assays, and inhibition of CTL 
recognition by peptide competition- Those peptides that bind 
to the class I molecule are further evaluated for their 

20 ability to serve as targets for CTLs derived from infected or 
immunized individuals, as well as for their capacity to induce 
primary 4n vitro or in vivo CTL responses that can give rise 
to CTL populations capable of reacting with virally infected 
target cells or tumor cells as potential therapeutic agents. 

25 The MHC class I antigens are encoded by the HLA-A, 

B, and C loci. HLA-A and B antigens are expressed at the cell 
surface at approximately equal densities, whereas the 
expression of HLA-C is significantly lower (perhaps as much as 
10 -fold lower) , Each of these loci have a number of alleles. 

3 0 The peptide binding motifs of the invention are relatively 
specific for each allelic subtype . 

For peptide-based vaccines, the peptides of the 
present invention preferably comprise a motif recognized by an 
MHC I molecule having a wide distribution in the human 

3 5 population. Since the MHC alleles occur at different 

frequencies within different ethnic groups and races, the 
choice of target MHC allele may depend upon the target 
population. Table 1 shows the frequency of various alleles at 
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Che HLA-A locus produces among different races. For inscance 
the niajority of the Caucasoid population can be covered by 
peptides which bind to four HLA-A allele subtypes, 
specifically HLA-A2.1, Al, A3. 2, andA24.1. Similarly, the 
majority of the Asian population is encc2:ipassed with the 
addition of peptides binding to a fifth allele .tILA-Ali.2. 
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TABLE 1 

A Allele /SubCVTSe N ( € ? ' ' A ( 5 4 '■ C :502) 



Al 10.1(7) 

A2.1 11.5(8) 

A2.2 10.1(7) 

A2.3 1.4(1) 
A2 . 4 
A2.5 

A3 . 1 1.4(1) 

A3 .2 5.7(4) 

All.l 0 

All. 2 5.7(4) 

All . 3 0 

A23 4.3 (3) 

A24 2.9(2) 
A24 .2 
A24.3 

A25 1.4(1) 

A26.1 4.3(3) 

A26.2 7,2(5) 
A26V 

A28.1 10.1(7) 

A2a.2 1.4(1) 

A29.1 1.4(1) 

A29.2 10.1(7) 

A30.1 8.6(6) 

A30.2 1.4(1) 

A30.3 7.2 (5) 

A31 4.3(3) 

A32 2.8(2) 

AW33.1 8.6(6) 

Aw33.2 2.8(2) 

AW34.1 1.4(1) 

AW34.2 14.5(10) 

Aw36 5.9(4) 



1_ , 


5 


(1) 


27.4 (138) 


37 . 




{20 ) 


39 . 8 ( 199 } 


0 






3 .3 (17) 


- 






0.8(4) 


- 

0 






- 

0.2 (0) 






(3 ) 


21.5 (108) 


5- 


- 


(3) 


0 


31 - 


4 


(17) 


a .7 (44) 


3 - 


7 


(2) 


0 


- 






3.9 (20) 


27 . 
• 


7 


(15) 


15.3 (77) 
- 


- 
- 






- 

6.9 (35) 


9 . 


2 


(5) 


5.9 (30) 


- 






1.0(5) 


3, 


7 


(2) 












- 






7.5 (38) 








1.4(7) 


1. 


8 


(1) 


5.3 (27) 








4.9(25) 








0.2(1) 








3.9 (20) 


7. 


4 


(4) 


6.9 (35) 








7.1(36) 








2.5 (13) 


16. 


6 


(9) 


1.2(6) 








0.8(4) 



Table compiled from B. DuPont, Immunobioloav of HLA . Vol. 
I, Hiscocompatibility Testing 1987, Springer- Verlag , New York 
1989. 

N - negroid; A = Asian; C = caucasoid. Numbers in 
parenthesis represent the number of individuals included in 
the analysis. 
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The nomenclature used to descr:Li;e peptide compounds 
follows Che conventional practice wherein the amino croup is 
presented to the left (the N- terminus) and the carcoxyi group 
CO the right (the C- terminus) of each amino acid residue. In 
the formulae representing selected specific embodiments of the 
present invention, the amino- and carcoxyi - terminal crouos, 
although not specifically shown, are in the form they would 
assume at physiologic pH values, unless otherwise specified. 
In the amino acid structure formulae, each residue is 
generally represented by standard three letter or single 
letter designations. The L-form of an amino acid residue is 
represented by a capital single letter or a capital first 
letter cf a three- letter s>Tnbol, and the D-form for those 
aminr acids having D- forms is represented by a lower case 
single letter or a lower case three letter symbol. Glycine 
has no asymmetric carbon atom and is simply referred to as 
"Gly" or G. 

The procedures used to identify peptides of the 
present invention generally follow the methods disclosed in 
Falk et al., mLHSS. 351:290 (1991), which is incorporated 
herein by reference. Briefly, the methods involve large-scale 
isolation of MHC class I molecules, typically by 
immunoprecipitation or affinity chromatography, from the 
appropriate cell or cell line. Examples of other methods for 
isolation of the desired MHC molecule equally well known to 
the artisan include ion exchange chromatography, lectin 
chromatography, size exclusion, high performance ligand 
chromatography, and a combination of all of the above 
techniques . 

In the typical case, immunoprecipitation is used to 
isolate the desired allele. A number of protocols can be 
used, depending upon the specificity of the antibodies used. 
For example, allele- specif ic mAb reagents can be used for the 
affinity purification of the HLA-a, HLA-Bj^, and HLA-C 
molecules. Several mAb reagents for the isolation of HLA-A 
molecules are available. The monoclonal BB7.2 is suitable for 
isolating HLA-A2 molecules. Affinity columns prepared with 
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chese mAbs using scandard cechniques are . successf uliv used zz 
purify Che respective KIA-A allele produces. 

In addition co allele- specif ic mAbs , broadlv 
reactive anti-HLA-A, 3, C mAbs, such as W6/32 and E9.12.1, and 
5 one anti-HLA-B, C mAb, 31.23.2, could be used in alternative 
affinity purification protocols as described in the example 
section below. 

The peptides bound to the peptide binding groove of 
the isolated MHC molecules are eluted typically using acid 

10 treatment. Peptides can also be dissociated from class I 

molecules by a variety of standard denaturing means, such as 
heat, pH, detergents, salts, chaotropic agents, or a 
combination thereof. 

Peptide fractions are further separated from the MHC 

15 molecules by reversed -phase high performance liquid 

chromatography (HPLC) and sequenced. Peptides can be 
separated by a variety of other standard means well known to 
the artisaji, including filtration, ultrafiltration, 
electrophoresis, size chromatography, precipitation with 

20 specific antibodies, ion exchange chromatography, 
isoelectrof ocusing, and the like. 

Sequencing of the isolated peptides can be performed 
according to standard techniques such as Edman degradation 
(Hunkapiller, M.W. , gt ^3,,, Methods E nzvmol , 91. 399 [1983])," 

25 Other methods suitaJsle for sequencing include mass 

spectrometry sequencing of individual peptides as previously 
described (Hunt, et al., Science 225:1261 (1992), which is 
incorporated herein by reference) . Amino acid sequencing of 
bulk heterogenous peptides ( e.g. . pooled HPLC fractions) from 

3 0 different class I molecules typically reveals a characteristic 
sequence motif for each class I allele. 

Definition of motifs specific for different class I 
alleles allows the identification of potential peptide 
epitopes from aji antigenic protein whose amino acid sequence 

35 is known. Typically, identification of potential peptide 

epitopes is initially carried out using a computer to scan the 
amino acid sequence of a desired antigen for the presence of 
motifs. The epitopic sequences are then synthesized. The 



wo 94/20127 



PCTAJS94/023S3 



12 



capacity co bind MHC Class molecules is measured in a variscv 
of differenr ways. One means is a Class I molecule binding ' 
assay as described in Example 4, below. Other alternatives 
described in the literature include inhibition of antiaen 
presentation (Sette, et al., J. Immunol . 141:3893 (1991), in 
Yitrc assembly assays (Townsend, et al., Cell €2:285 (1990) , 
and FAGS based assays using mutated ells, such as RMA.S 
(Melief, et al., Eur. J. Immunol . 21:29S3 (1991)). 

Next, peptides that test positive in the MHC class I 
bl.-.-.ng assay are assayed for the ability of the peptides to 
induce specific CTL responses in vitro . For instance, 
Antigen- presenting cells that have been incobated with a 
peptide can be assayed for the ability to induce CTL responses 
in responder cell populations. Antigen- presenting cells can 
normal cells such as peripheral blood mononuclear cells or 
--itic cells (Inaba, et al., J. Exp. Med. 166:182 (1987); 
aoos, Eur. J. Immunol . 18:219 (1988]). 

Alternatively, mutant raammalian cell lines that -v 
deficient in their ability to load class I molecules with 
internally processed peptides, such as the mouse cell lines 
RMA-S (KSrre. et al.. Nat^yg, 319:675 (1986); Ljunggren, et 
al. , Eyy. J, imrnimnl . 21:2963-2970 (1991)), and the human 
somatic T cell hybrid, T-2 (Cerundolo, et al. , Nature 345:449- 
452 (1990)) and which have been transfected with the 
appropriate human class I genes are conveniently used, when 
peptide is added to them, to test for the capacity of the 
peptide to induce in vi.trn primary CTL responses. Other 
eukar^'ot-c cell lines which could be used include various 
insect cell lines such as mosquito larvae (ATCC cell lines CCL 
125, 126, 1660, 1591, 6585, 6586), silkworm (ATTC CRL 8851), 
armywonn (ATCC CRL 1711) , moth (ATCC CCL 80) and Drosophila' 
cell lines such as a Schneider cell line (see Schneider 
Embrvol. Bxn. Mn-rphni -jn . ^t^r^ . ^f^r^ [1927]). 

Peripheral blood lymphocytes are conveniently 
isolated following simple venipuncture or leuJcapheresis of 
normal donors or patients and used as the responder cell 
sources of CTL precursors. In one embodiment, the appropriate 
antigen -presenting cells are incubated with 10-100 fxM of 
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.peptide in serum- free media fcr 4 hours under apprcpriace 
culture conditions. The peptide- loaded antigen-presenting 
cells are then incubated with the responder cell populations 
in vitro fcr 7 to 10 days under optimized culture conditions. 
5 Positive CTL activation can be determined by assaying the 
cultures for the presence of CTLs that kill radiolabeled 
target cells, both specific peptide-pulsed targets as well as 
target cells expressing endogenously processed form of the 
relevant virus or tumor antigen from which the peptide 

10 sequence was derived . 

Specificity and MHC restriction of the CTL is 
determined by testing against different peptide target cells" 
expressing appropriate or inappropriate human MHC class I, 
The peptides that test positive in the MHC binding assays and 

15 give rise to specific CTL responses are referred to herein as 
immunogenic peptides. 

The immunogenic peptides can be prepared 
synthetically, or by recombinant DNA technology or from 
natural sources such as whole viruses or tumors. Although the 

20 peptide will preferably be substantially free of other 

naturally occurring host cell proteins and fragments thereof,: 
in some embodiments the peptides can be synthetically 
conjugated to native fragments or particles. 

The polypeptides or peptides can be a variety of 

25 lengths, either in' their neutral (uncharged) forms or in forms 
which are salts, and either free of modifications such as 
glycosylation, side chain oxidation, or phosphorylation or 
containing these modifications, subject to the condition that 
the modification not destroy the biological activity of the 

30 polypeptides as herein described. 

Desirably, the peptide will be as small as possible 
while still maintaining substantially all of the biological 
activity of the large peptide. When possible, it may be 
desirable to optimize peptides of the invention to a length of 

35 about 8 to about 10 amino acid residues, commensurate in size 
with endogenously processed viral peptides or tumor cell 
peptides that are bound to MHC class I molecules on the cell 
surface. 
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Peptides having che desired accivicy may be modified 
as necessary to provide certain desired attributes, e.g., 
improved pharmacological characteristics, while increasing or 
at least retaining substantially all of the biological 
activity of the unmodified peptide to bind the desired MHC 
molecule and activate the appropriate T cell. For ■ instance, 
the peptides may be subject to various changes, such as 
substitutions, either conservative or non- conservative, where 
such changes might provide for certain advantages in their 
use, such as improved MHC binding. By conservative 
substitutions is meant replacing an amino acid residue with 
another which is biologically and/or chemically similar, e.g., 
one hydrophobic residue for another, or one pclar residue for 
another. The substitutions include combinations such as Gly, 
Ala; Val, He, Leu, Met; Asp, Glu; Asn, Gin; Ser, Thr; Lys, 
Arg; and Phe, Tyr. The effect of single amino acid 
substitutions may also be probed using D-amino acids. Such 
modifications may be made using well known peptide synthesis 
procedures, as described in e.g., Merrifield, Science 232:341- 
347 (1986), Barany and Merrifield, T--e Peptides . Gross and 
Meienhofer, eds. (N.Y., Academic Press), pp. 1-284 (1979); and 
Stewart and Young, Solid Pha se Peptide Svnthgc^-j ^ (Rockford, 
111., Pierce), 2d Ed. (1984), incorporated by reference 
herein. 

The peptides can also be modified by extending or 
decreasing the compound's amino acid sequence, e.g., by the 
addition or deletion of amino acids. The peptides or analogs 
of the invention can also be modified by altering the order or 
composition of certain residues, it being readily appreciated 
that certain amino acid residues essential for biological 
activity, e.g., those at critical contact' sites or conserved 
residues, may generally not be altered without an adverse 
effect on biological activity. The non-critical amino acids 
need not be limited to those naturally occurring in proteins, 
such as L-Qf-amino acids, or their D- isomers, but may include 
non-natural amino acids as well, such as )3-y.fi-amino acids, as 
well as many derivatives of L-a-amino acids. 
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Typically, a series of peptides w±zh single amino 
acid siibsricutions are employed to decezrrr.ine che effect of 
electrostatic charge, hydrophobicity , etc. on binding. For 
instance, a series of positively charged {e.g., Lys or Arg) or 
5 negatively charged (e.g. , Glu) amino acid substitutions are 
made along the length of the peptide revealing different 
patterns of sensitivity towards various MHC molecules and T 
cell receptors. In addition, multiple substitutions using 
small, relatively neutral moieties such as Ala, Gly, Pro, or 

10 similar residues may be employed. The substitutions may be 
homo- oligomers or hetero - oligomers . The number and types of 
residues which are substituted or added depend on the spacing 
necessary between essential contact points and certain 
functional attributes which are sought (e.g., hydrophobicity 

15 versus hydrophilicity) . Increased binding affinity for an MHC 
molecule or T cell receptor may also be achieved by such 
sxibstitutions , compared to the affinity of the parent peptide. 
In any event, such sxibsti tut ions should employ amino acid 
residues or other molecular fragments chosen to avoid, for 

20 example, steric and charge interference which might disrupt 
binding , 

Amino acid substitutions are . typically of single 
residues. Substitutions, deletions, insertions or any 
combination thereof may be combined to arrive at a final 
25 peptide. Substitutional variants are those in which at least 
one residue of a peptide has been removed ajid a different 
residue inserted in its place. Such substitutions generally 
are made in accordance with the following Table 2 when it is 
desired to finely modulate the characteristics of the peptide. 
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TABLE 2 

Qricrinal Residue Exemplary Substirut^ ^ on 



Ala 




Ser 


Arg 


Lys, 


His 


Asn 




Gin 


Asp 




Glu 


Cys 




Ser 


Gin 




Asn 


Glu 




Asp 


Gly 




Pro 


His 


Lys; 


Arg 


lie 


Leu; 


Val 


Leu 


Ile; 


Val 


Lys 


Arg; 


His 


Met 


Leu; 


He 


Phe 


Tyr; 


Trp 


jser 


Thr 


Thr 


Ser 


Trp 


Tyr; 


Phe 


T/r 


Trp; 


Phe 


Val 


He; 


Leu 
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Subscanciai changes in function le.c., affinizv for 
MHC molecules or T cell receptors) are made by selecrina 
siibscitunions chat are less conservative than those in Table 
2, i.e., selecting residues that differ r;ore significantly in 
5 their effect on maintaining (a) the structure of the peptide 
backbone in the area of the substitution, for example as a 
sheet or helical conformation, (b) the charge or 
hydrophobicity of the molecule at the target site or (c) the 
bulk of the side chain. The substitutions which in general 

10 are expected to produce the greatest changes in peptide 

properties will be those in which (a) hydrophilic residue, 
e.g. seryl, is substituted for (or by) a hydrophobic residue, 
e.g. leucyl, isoleucyl, phenylalanyl , valyl or alanyl ; (b) a 
residue having an electropositive side chain, e.g., lysl, 

15 arginyl, or histidyl, is substituted for (or by) an 

electronegative residue, e.g. glutamyl or aspartyl; or (c) a 
residue having a bulky side chain, e.g. phenylalanine, is 
sxibstituted for (or by) one not having a side chain, e.g., 
glycine. 

20 The peptides may also comprise isosteres of two or 

more residues in the immunogenic peptide. An isostere as 
defined here is a sequence of two or more residues that can be 
substituted for a second sequence because the steric 
conformation of the first sequence fits a binding site 

25 specific for the second sequence. The term specifically 

includes peptide backbone modifications well known to those 
skilled in the art. Such modifications include modifications 
of the amide nitrogen, the a- carbon, amide carbonyl, complete 
replacement of the amide bond, extensions, deletions or 

30 backbone crosslinks. See, generally . Spatola, Chemistry and 
Biochemistry of Amino Acids, peptides and Proteins . Vol. VII 
(Weinstein ed. , 1983) . 

Modifications of peptides with various amino acid 
mimetics or unnatural amino acids are particularly useful in 

35 increasing the stability of the peptide in vivo . Stability 

can be assayed in a number of ways. For instance, peptidases 
and various biological media, such as human plasma and serum, 
have been used to test stability. See , e.g. . Verhoef et al., 
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Sur. J. Drrg Mgtab. Ph anracskin . 11:291-3 02 (19 86) . Half 
life of the pepcides of the present inver.-ion is conveniently 
detennined using a 25% human serum (v/v) assay. The trotocci 
is generally as follows. Pooled hianan senai (Type AB, 
-non- heat inactivated) is delipidated by centrifugation before 
use. The serum is then diluted to 25% with RPMI tissue 
culture media and used to test peptide stability. At 
predetermined time intervals a small amount of reaction 
solution is removed and added to either S% aqueous 
trichloracetic acid or echanol. The cloudy reaction sample is 
cooled (4°C) for 15 minutes and then spun to pellet the 
precipitated serum proteins. The presence of the peptides is 
then determined by reversed -phase HPLC using 
stability- specif ic chromatography conditions. 

The peptides of the present invention or analogs 
thereof which have CTL stimulating activity may be modified to 
provide desired attributes other than improved serum half 
-ife. For instance, the ability of the peptides to induce CTL 
activity can be enhanced by linkage to a sequence which 
contains at least one epitope that is capable of inducing a T 
helper cell response. 

In some embodiments, the T helper peptide is one that 
is recognized by T helper cells in the majority of the 
population. This can be accomplished by selecting amino acid 
sequences that bind to many, most, or all of the MHC class II 
molecules. These are known as "loosely MHC- restricted" T 
helper "^quences. Examples of amino acid sequences that are 
Ivv.:..-. -iC- restricted include sequences from antigens such as 
Tetanus toxin at positions 830-843 (QYIKANSKFIGITE) , 
Plasmodium falciparum CS protein at positions 378-398 
(DIEKKIAKMEKASSVFNWNS) V and Streptococcus lakD protein at 
positions 1-16 ( YGAVDSILGGVATYGAA) . 

Alternatively, it is possible to prepare synthetic 
peptides capable of stimulating T helper lymphocytes, in a 
loosely MHC- restricted fashion, using amino acid sequences not 
found in nature. These synthetic compounds called 
Pan-DR-binding epitope (PADRE) are designed on the basis of 
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cheir binding accivicy zo mosc, HLA-DR (human MHC class ZD 
molecules (see, copending applicacion USSN 08/121,101). 

Particularly preferred iinmunogenic peptides /T helper 
conjugates are linked by a spacer molecule. The spacer is 
5 typically comprised of relatively small, neutral molecules, 
such as amino acids or amino acid mimetics, which are 
substantially uncharged under physiological conditions. The 
spacers are typically selected from, e.g. , Ala, Gly, or other 
neutral spacers of nonpolar amino acids or neutral polar amino 

10 acids. It will be understood thiat the optionally present 

spacer need not comprise the same residues and thus may be a 
hetero- or homo -oligomer. When present, the spacer will 
usually be at least one or two residues, more usually three to 
six residues. Alternatively, the CTL peptide may be linked to 

15 the T helper peptide without a spacer. 

The immunogenic peptide may be linked to the T helper 
peptide either directly or via a spacer either at the amino or 
carboxy terminus of the CTL peptide. The amino terminus of 
either the immunogenic peptide or the T helper peptide may be 

20 acylated. Exemplary T helper peptides include tetanus toxoid 
830-843, influenza 3 07-319, malaria circumsporozoite 382-398- 
and 378-3 89. 

In some embodiments it may be desirable to include in 
the pharmaceutical compositions of the invention at least one 

25 component which primes CTL. Lipids have been identified as 

agents capable of priming CTL in vivo against viral antigens. 
For example, palmitic acid residues can be attached to the 
alpha and epsilon amino groups of a Lys residue and then 
linked, e.g., via one or more linking residues such as Gly, 

30 Gly-Gly- , Ser, Ser-Ser, or the like, to an immunogenic 

peptide. The lipidated peptide can then be injected directly 
in a micellar form, incorporated into a liposome or emulsified 
in an adjuvant, e.g., incomplete Freund's adjuvant. In a 
preferred embodiment a particularly ef f ective - immunogen 

35 comprises palmitic acid attached to alpha and epsilon amino 
groups of Lys, which is attached via linkage, e.g., Ser-Ser, 
to the amino terminus of the immunogenic peptide. 
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As another example of lipid priniing cf C7L responses, 
E- "oli lipoproceins , such as 

tripalmicoyl-S-glycerylcysceinlyseryi -serine (P3CSS) can be 
used to prime virus specific CTL when covalently attached to 
an appropriate peptide. See, Deres et al . , Nature 342:561-564 
(19 89), incorporated herein fay reference. Peptides of the 
invention can be coupled to P3CSS, for example, and the 
lipopeptide administered to an individual to specifically 
prime a CTL response to the target antigen. Further, as the 
induction of neutralizing antibodies can also be primed with 
P3CSS conjugated to a peptide which displays an appropriate 
epitope, the two compositions can be combined to more 
effectively elicit both humoral and cell -mediated responses to 
infection. 

In addition, additional amino acids can be added to 
the termini of a peptide to provide for ease of linking 
peptides one to another, for coupling to a carrier support, or 
larger peptide, for modifying the physical or chemical 
properties of the peptide or oligopeptide, or the like. Amino 
acids such as tyrosine, cysteine, lysine, glutamic or aspartic 
acid, or the like, can be introduced at the C- or N- terminus 
of the peptide or oligopeptide. Modification at the C 
terminus in some cases may alter binding characteristics of 
the peptide. In addition, the peptide or oligopeptide 
sequences can differ from the natural sequence by being 
modified by terminal-NHj acylation, e.g., by alkanoyl (Cj^-C^q) 
or thioglycolyl acetylation. terminal - carboxyl amidation, 
e.g., ammonia, methylamine, etc. In some instances these 
modifications may provide sites for linking to a support or 
other molecule. 

The peptides of the invention can be prepared in a 
wide variety of ways. Because of their relatively short size, 
the peptides can be synthesized in solution or on a solid 
support in accordance with conventional techniques. Various 
automatic synthesizers are commercially available and can be 
used in accordance with known protocols. See, for example, 
Stewart and Young. Soljd Phase Peor-iriP ^yr^^^^c,^ 2d. ed. , 
Pierce Chemical Co. (1984), supra . 
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Alcemaniveiy , rscombinanc DNA cecnnciogy may be 
employed wherein a nucleocide sequence which encodes an 
immunogenic peptide of interest is inserted into an expression 
vector, transformed or transfected into an appropriate host 
5 cell and cultivated under conditions suitable for expression. 
These procedures are generally known in the art, as described 
generally in Sambrook et al . , Molecular Cloning, A LaborarnT-v 
Manual, Cold Spring Harbor Press, Cold Spring Harbor, New York 
(1982), which is incorporated herein by reference. Thus, 
10 fusion proteins which comprise one or more peptide sequences 

of the invention can be used to present the appropriate T cell 
epitope . 

As the coding sequence for peptides of the length 
contemplated herein can be synthesized by chemical technioues, 

15 for example, the phosphotriester method of Matteucci et al., 
J. Am, Chem. Soc . 103:3185 (1981), modification can be made 
simply by substituting the appropriate base(s) for those 
encoding the native peptide sequence. The coding sequence can 
then be provided with appropriate linkers and ligated into 

20 expression vectors commonly available in the art, and the 
vectors used to transform suitable hosts to produce the 
desired fusion protein. A number of such vectors. and suitable 
host systems are now available. For expression of the fusion- 
proteins, the coding sequence will be provided with operably 

25 linked start and stop codons, promoter and terminator regions 
and usually a replication system to provide an expression 
vector for expression in the desired cellular host. For 
example, promoter sequences compatible with bacterial hosts 
are provided in plasmids containing convenient restriction 

30 sites for insertion of the desired coding sequence. The 

resulting expression veccors are transformed into suitable 
bacterial hosts. Of course, yeast or mammalian cell hosts may 
also be used, employing suitable vectors and control 
sequences . 

35 The peptides of the present invention and 

pharmaceutical and vaccine compositions thereof are useful for 
administration to mammals, particularly humans, to treat 
and/or prevent viral infection and cancer. Examples of 
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diseases which can bs creaced using the immunogenic peptides 
of Che invention include prostate cancer, hepatitis B, 
hepatitis C, AIDS, renal carcinoma, cervical carcinoma, 
lymphoma, CMV and condlyloma acuminatum. 

For pharmaceutical compositions, the immunogenic 
peptides cf the invention are administered to an individual 
already suffering from cancer or infected with the virus of 
interest. Those in the incubation phase or the acute phase of 
infection can be treated with the immunogenic peptides 
separately or in conjunction with other treatments, as 
appropriate. In therapeutic applications, compositions are 
administered to a patient in an amount sufficient to elicit an 
effective CTL response to the virus or tumor antigen and to 
cure or at least partially arrest symptoms and/or 
complications. An amount adequate to accomplish this is 
defined as "therapeutically effective dose." Amounts 
effective for this use will depend on, e.g., the peptide 
composition, the manner of administration, the stage and 
severity of the disease being treated, the weight and general 
state of health of the patient, and the judgment of the 
prescribing physician, but generally range for the initial 
immunization (that ia for therapeutic or prophylactic 
administration) from about 1.0 /ig to about 5000 of peptide 
for a 70 kg patient, followed by boosting dosages of from 
about 1.0 ng to about lOOO /xg of peptide pursuant to a 
boosting regimen over weeks to months depending upon the 
patient's response and condition by measuring specific CTL 
activity in the patient's blood. It must be kept in mind that, 
the peptides and compositions of the present invention may 
generally be employed in serious disease states, that is, 
life- threatening or potentially life threatening situations. 
In such cases, in view of the m.inimization of extraneous 
substances and the r-iative nc--3xic nature of the peptides, 
it is possible and may be felt desirable by the treating 
physician to administer substantial excesses of these peptide 
compositions . 

For therapeutic use, administration should begin at 
the first sign of viral infection or the detection or surgical 
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removal of cumors or shorcly after diagnosis in the case of 
acure infection. This is followed by boosting doses until at 
least symptoms are substantially abated and for a period 
thereafter. In chronic infection, loading doses followed by 
5 boosting doses may be required. 

Treatment of an infected individual with the 
compositions of the invention may hasten resolution, of the 
infection in acutely infected individuals. For those 
individuals susceptible (or predisposed) to developing chronic 

10 infection the compositions are particularly useful in methods 
for preventing the evolution from acute to chronic infection. 
Where the susceptible individuals are identified prior to or 
during infection, for instance, as described herein, the 
composition can be targeted to them, minimizing need for 

15 administration to a larger population. 

The peptide compositions can also be used for the 
treatment of chronic infection and to stimulate the immune 
system to eliminate virus- infected cells in carriers. It is 
important to provide an amount of immuno- potentiating peptide 

20 in a formulation and mode of administration sufficient to 

effectively stimulate a cytotoxic T cell response. Thus, for 
treatment of chronic infection, a representative dose is in 
the range of about 1.0 to about 5000 ^g, preferably about 5 
^g to 1000 /xg for a "^/O )cg patient per dose. Immunizing doses 

25 followed by boosting doses at established intervals, e.g., 
from one to four weelcs, may be required, possibly for a 
prolonged period of time to effectively immunize an 
individual. In the case of chronic infection, administration 
should continue until at least clinical symptoms or laboratory 

3 0 tests indicate that the viral infection has been eliminated or 
substantially abated and for a period thereafter. 

The pharmaceutical compositions for therapeutic 
treatment are intended for parenteral, topical, oral or local 
administration. Preferably, the pharmaceutical compositions 

35 are administered parenterally , e.g., intravenously, 

subcutaneously, intradermally , or intramuscularly. Thus, the 
invention provides compositions for parenteral administration 
which comprise a solution of the immunogenic peptides 
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dissolved or suspended in an acceptable carrier, preferably an 
aqueous carrier. A variety of aqueous carriers may be used, 
e.g., water, buffered water, 0.8% saline, 0.3% glycine, 
hyaluronic acid and the like. These ccrpositions may be 
sterilized by conventional, well Jcnown sterilization 
techniques, or may be sterile filtered. The resulting aqueous 
solutions may be packaged for use as is, or lyophilized, the 
lyophilized preparation being combined with a sterile solution 
prior to administration. The compositions may contain 
pharmaceutically acceptable auxiliary substances as required 
to approximate physiological conditions, such as pH adjusting 
and buffering agents, tonicity adjusting agents, wetting 
agents and the like, for example, sodium acetate, sodium 
lactate, sodium chloride, potassium chloride, calcium 
chloride, sorbitan monolaurate, triethanolamine oleate, etc. 

The concentration of CTL stimulatory peptides of the 
invention in the pharmaceutical formulations can vary widely, 
i.e.. from less than about 0.1%, usually at or at least about 
2% to as much as 20% to 50% or more by weight, and will be 
selected primarily by fluid volumes, viscosities, etc.. in 
accordance with the particular mode of administration 
selected. 

The peptides of the invention may also be administered 
-iposomes, which serve to target the peptides to a 
particular tissue, such as lymphoid tissue, or targeted 
selectively to infected cells, as well as increase the half- 
life of the peptide composition. Liposomes include emulsions, 
foams, micelles, insoluble monolayers, liquid crystals, 
phosphr^ .-id dispersions, lamellar layers and the like. In 
tiiese .aparations the peptide- to be delivered is incorporated 
as ..„-t of a liposome, alone or in conjunction with a molecule 
which binds to, e.g., a receptor prevalent among lymphoid 
cells, such as monoclonal antibodies which bind to the CD45 
antigen, or with other therapeutic or immunogenic 
compositions. Thus, liposomes either filled or decorated with 
a desired peptide of the invention can be directed to the site 
of lymphoid cells, where the liposomes then deliver the 
selected therapeutic /immunogenic peptide compositions. 
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Liposomes for use in nhe invention are farmed from Suandard 
vesicle- forming lipids, which generally include neucral and 
negatively charged phospholipids and a scercl, such as 
cholesterol. The selection of lipids is generally guided by 
5 consideration of, e.g., liposome size, acid lability and 

stability of the liposomes in the blood stream, A variety of 
methods are available for preparing liposomes, as described 
in, e.g., SzoJca et al., Ann. Rev. Bioohvs . Sioena . 9:467 
(1980), U.S. Patent Nos. 4,235,871. 4,501,728, 4,837,028, and 

10 5,019,3 69, incorporated herein by reference. 

For targeting to the immune cells, a ligand to be 
incorporated into the liposome can include, e.g., antibodies 
or fragments thereof specific for cell surface determinants of 
the desired immune system cells. A liposome suspension 

15 containing a peptide may be administered intravenously, 

locally, topically, etc. in a dose which varies according to, 
inter alia , the manner of administration, the peptide being 
delivered, and the stage of the disease being treated. 

For solid compositions, conventional nontoxic solid 

20 carriers may be used which include, for example, 

pharmaceutical grades of mannitol, lactose, anarch, zhagnesium 
stearate, sodium saccharin, talcum, cellulose, glucose, 
sucrose, magnesium carbonate, and the like. For oral 
administration, a pharmaceutically accepnable nontoxic 

25 composition is formed by incorporating any of the normally 

employed excipients, such as those carriers previously listed, 
and generally 10-95% of active ingredient., that is, one or 
more peptides of the invention, and more preferably at a 
concentration of 25% -75%. 

30 For aerosol administration, the immunogenic peptides 

are preferably supplied in finely divided form along with a 
surfactant and propellant. Typical percentages of peptides 
are 0.01% -20% by weight, preferably 1%-10%. The surfactant 
must, of course, be nontoxic, and preferably soluble in the 

35 propellant. Representative of such agents are the- -esters or 
partial esters of fatty acids containing from 6 to 22 carbon 
atoms, such as caproic, octanoic, lauric, palmitic, stearic, 
linoleic, linolenic, olesteric and . oleic acids with an 
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aliphacic polyhydric alcohol or its cyclic anhydride. Mixed 
esters, such as mixed or natural glyceridss may be employed. 
The surfactant may constitute 0.1% -2 0% by weight of the 
composition, preferably 0.25-5%. The balance of the 
composition is ordinarily propellant . A carrier can also be 
included, as desired, as with, e.g., lecithin for intranasal 
delivery. 

In another aspect the present invention is directed to 
vaccines which contain as an active ingredient an 
immunogenically effective amount of an immunogenic peptide as 
described herein. The peptide (s) may be introduced into a 
host, including humans, linked to its own carrier or as a 
homopolymer or heceropolymer of active peptide units. Such a 
polymer has the advantage of increased immunological reacticn 
and, where different peptides are used to make up the polymer, 
the additional ability to induce antibodies and/or CTLs that 
react with different antigenic determinants of the virus or 
tumor cells. Useful carriers are well known in the art, and 
include, e.g., thyroglobulin, albumins such as human serum 
albumin, tetanus toxoid, polyamino acids such as 
poly (lysine: glutamic acid), influenza, hepatitis B virus core 
protein, hepatitis B virus recombinant vaccine and the like. 
The vaccines can also contain a physiologically tolerable 
(acceptable) diluent such as water, phosphate buffered saline, 
or saline, and further typically include an adjuvant. 
Adjuvants such as incomplete Freund's adjuvant, aluminum 
phosphate, aluminum hydroxide, or alum are materials well 
known in the art. And. as mentioned above, CTL responses can 
be primed by conjugating peptides of the invention to lipids, 
such as P3CSS. Upon immunization with a peptide composition' 
as described herein, via injection, aerosol, oral, transdermal 
or other route, the immune system of the host responds to the 
vaccine by producing large amounts of CTLs specific for the 
desired antigen, and the host becomes at least partially 
immune to later infection, or resistant to developing chronic 
infection. 

vaccine compositions containing the peptides of the 
invention are administered to a patient susceptible to or 
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ochsrwise az risk of virai infection c-r cancer zo eiicir an 
iimnune response against: zhe ancigen and zhus ennance the 
pacienc ' s own iinmune response capabilities . Such an amount is 
defined to be an " immunogenically effective dose." In this 
5 use, the precise amounts again depend on the patient's state 
of health and weight, the mode of administration, the nature 
of the formulation, etc., but generally range from about 1,0 
fig to about 5000 fxg per 70 kilogram patient, more commonly 
from about 10 /xg to about 500 /xg mg per 70 kg of body weight. 

10 In some instances it may be desirable to combine the 

peptide vaccines of the invention with vaccines which induce 
neutralizing antibody responses to the virus of interest, 
particularly to viral envelope antigens. 

For therapeutic or immunization purposes, the peptides 

15 of the invention can also be expressed by attenuated viral 
hosts, such as vaccinia or fowlpox. This approach involves 
the use of vaccinia virus as a vector to express nucleotide 
sequences that encode the peptides of the invention. Upon 
introduction into an acutely or chronically infected host or 

20 into a non- infected host, the recombinant vaccinia virus 

expresses the immunogenic peptide, and thereby elicits a host 
CTL response. Vaccinia vectors and methods useful in 
immunization protocols are described in, e.g., U.S. Patent No. 
4,722,848, incorporated herein by reference. Another vector. 

25 is BCG (Bacille Calmette Guerin) . BCG vectors are described 
in Stover et al . ( Nature 351:456-460 (1991)) which is 
incorporated herein by reference. A wide variety of other 
vectors useful for therapeutic administration or immunization 
of the peptides of the invention, e.g.. Salmonella typhi 

3 0 vectors and the like, will be apparent to those skilled in the 
art from the description herein. 

Antigenic peptides may be used to elicit CTL ex vivo , 
as well. The resulting CTL, can be used to treat chronic 
infections (viral or bacterial) or tumors in patients that do 

35 not respond to other conventional forms of therapy, or will 

not respond to a peptide vaccine approach of therapy. Ex vivo 
CTL responses to a particular pathogen (infectious agent or 
tumor antigen) are induced by incubating in tissue culture the 
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pacienc's CTL precursor ceils (CTLp) rogecher with a source of 
antigen- presenting ceils (APC) and the appropriate immunogenic 
peptide. After an appropriate incubation time (typically 1-4 
weeks), in which the CTLp are activated ' and mature and expand 
5 into effector CTL, the ceils are infused back into the 

patient, where they will destroy their specific target cell 
(an infected cell or a tumor cell) . 

The peptides may also find use as diagnostic reagents. 
For example, a peptide of the invention may be used to 

10 determine the susceptibility of a particular individual to a 
treatment regimen which employs the peptide or related 
peptides, and thus may be helpful in modifying an existing 
treatment protocol or in determining a prognosis for an 
affected individual. In addition, the peptides may also be 

15 used to predict which individuals will be at substantial risk 
for developing chronic infection. 

The following examples are offered by way of 
illustration, not by way of limitation. 
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£xamr:.Le 1 
Class I antiaen isolanicn 
A flow diagram of an KLA-A annigen purification scheme 
is presenred in Figure 1. Briefly, che cells bearing che 
5 appropriace allele were grown in large bacches (6-8 licers 
yielding -5 x 10^. cells) , harvested by cenrrif ugacion and 
washed. All cell lines were maintained in RPMI 1640 media 
(Sigma) supplemented with 10% fetal bovine serum (FBS) and 
antibiotics. For large-scale cultures, cells were grown in 

10 roller bottle culture in RPMI 1640 with 10% F3S . or with 10% 
horse serum and antibiotics. Cells were harvested by 
centrifugation at 1500 RPM IEC-CRU5000 centrifuge with 259 
rotor and washed three times with phosphate-buf f ered saline 
(PBS) (0.01 M PO4, 0.154 M NaCl , pH 7.2) . 

15 Cells were pelleted and stored at -70°C or treated 

with detergent lysing solution to prepare detergent lysates . 
Cell lysates were prepared by the addition of stock detergent 
solution [1% NP-40 (Sigma) or Renex 30 (Accurate Chem. Sci. 
Corp., Westbury, NY 11590), 150 mM NaCl, 50 mM Tris , pH 8*0] 

20 to the cell pellets (previously counted) at a ratio of 50-100 
X 10^ cells per ml detergent solution. A cocktail of protease 
inhibitors was added to the premeasured volume of stock 
detergent solution immediately prior to the addition to the 
cell pellet. Addition of the protease inhibitor cocktail 

25 produced final concentrations of the following: 

phenylmethylsulf onyl fluoride (PMSF) , 2 mM; aprotinin, 5 
/xg/ml; leupeptin, 10 /ig/ml; pepstatin, 10 ^g/ml; 
iodoacet amide, 100 fjM; ajid EDTA, 3 ng/ml . Cell lysis was 
allowed to proceed at 4*»C for 1 hour with periodic mixing. 

30 Routinely 5-10 x 10^ cells were lysed in 50-100 ml of 
detergent solution. The lysate was clarified by 
centrifugation at 15,000 x g for 30 minutes at 4®C and 
subsequent passage of the supernatant fraction through a 0.2 
filter unit (Nalgene) . 

35 The HLA-A antigen purification was achieved using 

affinity columns prepared with mAb- conjugated Sepharose beads. 
For antibody production, cells were grown in RPMI with 10% FBS 
in large tissue culture flasks (Coming 25160-225) . 



wo 94/20U7 



PCTA.S94/02353 



Antibodies were purified from ciarified rissue culture medium 
by ammoniuin sulfate fractionation followed by affinity 
chromatography on prctein-A-Sepharcse (Sigma) . Briefly, 
saturated ammonium sulfate was added slowly with stirrina to 
5 the tissue culture supernatant to 45% {volume to volume) 
overnight at 4*'C to precipitate the immunoglobulins. The 
precipitated proteins were harvested by centrif ugation at 
10,000 X g for 30 minutes. The precipitate was then dissolved 
in a minimum volume cf PBS and transferred to. dialysis tubing 

10 (Spectro/Por 2, Mol. wt. cutoff 12,000-14,000, Spectum Medical 
Ind.). Dialysis was against PBS (a20 times the protein 
solution volume) with 4-6 changes of dialysis buffer over a 
24-48 hour period at 4«C. The. dialyzed protein solution was 
clarified by centrifugation {10,000 x g for 30 minutes) and 

15 the pH of the solution adjusted to pH 8.0 with IN NaOH. 

Protein- A- Sepharose (Sigma) was hydrated according to the 
manufacturer's instructions, and a protein- A- Sepharose column 
was prepared. A column of 10 ml bed volume typically binds 
50-100 mg of mouse IgG. 

20 The protein sample was loaded onto the protein-A- 

Sepharose column using a peristaltic pump for large loading 
volumes or by gravity for smaller volumes {<100 ml) . The 
column was washed with several volumes of PBS, cuid the eluate 
was monitored at A2 80 in a spectrophotometer until base line 

25 was reached. The bound antibody was eluted using 0.1 M citric 
acid at suitable pH (adjusted to the appropriate pH with IN 
NaOH) . For mouse IgG-1 pH 6.5 was used for IgG2a pH 4.5 was 
used and for IgG2b and IgGB pH 3.0 was used. 2 M Tris base 
was used to neutralize the eluate. Fractions containing the 

30 antibody (monitored by A280) were pooled, dialyzed against PBS 
and further concentrated using an Amicon Stirred Cell system 
(Amicon Model 8050 with YM30 membrane) . The anti-A2 mAb, 
BB7.2, was useful for affinity purification. 

The HLA-A antigen was purified using affinity columns 

35 prepared with mAb -conjugated Sepharose beads. The affinity 

colTjmns were prepared by incubating protein- A- Sepharose beads 
(Sigma) with affinity-purified mAb as described above. Five 
to 10 mg of mAb per ml of bead is the preferred ratio. The 



wo 94/20127 



PCT/ijSV4/023ij 



3 1 

ruAb cound beads were washed wich boraue buffer (borate buffer: 
100 mM sodium tecraborace , 154 mM NaCl , pH 8.2) unnil zhe 
washes show A2 8 0 ac based line. Dimechyl pimeiiniidace (20 mM) 
in 200 mM triechanolamine was added to ccvaiently crosslink 
5 Che bound mAb to the protein- A-Sepharose (Schneider et ai . , J. 
Riol. Chem. 257:10766 (1982), After incubation for 45 minutes 
at room temperature on a rotator, the excess crosslinking 
reagent was removed by washing the beads twice with 10-20 ml 
of 20 mM ethanolamine , pH 8.2, Between each one the slurry 

10 was placed on a rotator for 5 minutes at. room temperature. 
The beads were washed with borate buffer and with PBS plus 
0.02% sodium azide. 

The cell lysate (5-10 x 10^ cell equivalents) was then 
slowly passed over a 5-10 ml affinity column (flow rate of 

15 0.1-0.25 ml per minute) to allow the binding of the antigen to 
the immobilized antibody. After the lysate was allowed to 
pass through the column, the column was washed sequentially 
with 20 column volumes of detergent stock solution plus 0,1% 
sodium dodecyl sulfate, 20 column volumes of 0.5 M NaCl, 20 mM 

20 TriSr pH 8,0, and 10 column volumes of 20 mM Tris, pH 8.0. 

The HLA-A antigen bo\xnd to the mAb was eluated with a basic 
buffer solution (50 mM diethylamine in water) . As an 
alternative, acid solutions such as 0.15-0.25 M acetic acid 
were also used to elute the bound antigen. An alicjuot of the 

25 eluate (1/50) was removed for protein quantification using 
either a colorimetric assay (EGA assay, Pierce) or by SDS- 
PAGE, or both. SDS-PAGE analysis was performed as described 
by Laemmli (Laemmli, U.K., Nature 227:680 (1970)) using known 
amounts of bovine serum albumin (Sigma) as a protein standard. 

3 0 Allele specific antibodies were used to purify the specific 

MHC molecule. In the case of HLA-A2, the mAb BB7.2 was used. 

Example 2 

Isolation and sequencing of naturally processed peptides 
3 5 For the HLA-A preparations derived from the base (50 

mM diethylamine) elution protocol, the eluate was immediately 
neutralized with l N acetic acid to pH 7.0-7.5. The 
neutralized eluate was concentrated to a volume of 1-2 ml in 
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an Amiccn scirred ceii [Modsi 8050, wicr. YM3 membranes 
(Amicon) ] . Ten ml of ammonium acecace ;0.0l M, pH 8,0) was 
added co the concencracor co remove che non-volatile salts, 
and the sample was concentrated to approximately 1 ml. a 
small sample (1/50) was removed for protein quantitation as 
described above. The remainder was recovered into a 15 ml 
polypropylene conical centrifuge tube (Falcon, 2097) (Becton 
Dickinson) . Glacial acetic acid was added to obtain a final 
concentration of 10% acetic acid. The acidified sample was 
placed in a boiling water bath for 5 minutes to allow for the 
dissociation of the bound peptides. The sample was cooled on 
ice, returned to che concentrator and the filtrate was 
collected. Additional aliquots of 10% acetic acid (1-2 ml) 
were added to the concentrator, and this filtrate was pooled 
with the original filtrate. Finally, 1-2 ml of distilled 
water was added to the concentrator, and this filtrate was 
pooled as well. 

The retentate contains the bulk of the HLA-a heavy 
chain and /Sj -microglobulin, while the filtrate contains the 
naturally processed bound peptides and other components with 
molecular weights less than about 3000. The pooled filtrate 
material was lyophilized in order to concentrate the peptide 
fracrion. The sample was then ready for further analysis. 

For HPLC (high performance liquid chromatography) 
separation of the peptide fractions, the lyophilized sample 
was dissolved in 50 nl of distilled water, or into 0.1% 
trifluoracetic acid (TFA) (Applied Biosystems) in water and 
injected to a C18 reverse-phase narrow bore column (Beckman 
Cia Ultrasphere, 10 x 250 mm), using a gradient system 
described by Stone and Williams (Stone, K.L. and Williams 
K.R., in, Macromolecular Sequencing and Synthesis; Selected 
Methods and Applications, A.R. Liss, New York, 1988, pp. 7-24. 
Buffer A was 0.06% TFA in water (Burdick- Jackson) and buffer 3 
was 0.052% TFA in 80% acetonitrile ( Burdick -Jacks on ) . The 
flow rate was 0.250 ml/minute with the following gradient: 0- 
60 min., 2-37.5% B; 60-95 min. , 37.5-75% B; 95-105 min. , 75- 
98% B. The Gilson narrow bore HPLC configuration is 
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parcicuiariy useful for chis purpose, aitr.ough ccher 
conf iguranions work equally well. 

A large number of peaks were dececced by absorbance az 
214 rm, majiy of which appear no be of low abundance. Whecher 
5 a given peak represents a single peptide cr a peptide rr.ixcure 
was not: determined. Pooled fractions were then sequenced to 
determine motifs specific for each allele as described below. 

Pooled peptide fractions, prepared as described above 
were analyzed by automated Edman sequencing using the Applied 
10 Biosystems Model 477A automated sequencer. The sequencing 
method is based on the technique developed by Pehr Edman in 
the 1950s, for the sequential degradation of proteins and 
peptides to determine the sequence of the constituent amino 
acids . 

15 The protein or peptide to be sequenced was held by a 

12 -mm diameter porous glass fiber filter disk in a heated, 
argon-purged reaction chamber. The filter was generally pre- 
treated with BioBrene Plus™ and then cycled through one or 
more repetitions of the Edman reaction to reduce contaminajats 

20 and improve the efficiency of subsequent sample sequencing. 

Following the pre- treatment of the filter, a solution of the 
sample protein or peptide (10 pmol-5 nmol range) was loaded 
onto the glass filter and dried. Thus, the sample was left 
embedded in the film of the pre- treated disk. Covalent 

25 attachment of the sample to the filter was usually not 

necessary because the Edman chemistry utilized relatively 
apolar solvents, in which proteins and peptides are poorly 
soluble. 

Briefly, the Edman degradation reaction has three 
30 steps: coupling, cleavage, and conversion. In coupling step, 
phenylisothiocyanate (PITC) is added. The PITC reacts 
quantitatively with the free amino- terminal amino acid of the 
protein to form the phenylthiocarbamyl- protein in a basic 
environment. After a period of time for the coupling step, 
35 the excess chemicals are extracted and the highly volatile 

organic acid, trif luoroacetic acid, TFA, is used to cleave the 
PITC- coupled amino acid residue from the amino terminus of the 
protein yielding the anilinothiazolinone (ATZ) derivative of 
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Che amino acid. The rernaining procein/pepciae is left wich a 
new amino terminus and is ready for the next Edman cycle. The 
ATZ amino acid is extracted and transferred to a conversion 
flask, where upon addition of 25% TFA in water, the ATZ amino 
acid is converted to the more' stable phenyl thiohydantoin (PTH) 
amino acid that can be identified and quantified followino 
automatic injection into the Model 120 PTH Analyzer which"uses 
a microbore C-18 reverse -phase HPLC column for the analysis. 

In the present procedures, peptide mixtures were 
loaded onto the glass filters. Thus, a single amino acid 
sequence usually does not result. Rather, mixtures of amino 
acids in different yield are found. When the particular 
residue is conserved among the peptides being sequenced, 
increased yield for that amino acid is observed. 

Example 3 

Definition of an A2 . 1 snecific mntj- ^if 
In one case, pooled peptide fractions prepared as 
described in Example 2 above were obtained from HIiA-A2.l 
homozygous cell lines, for example, JY. The pooled fractions 
were HPLC fractions corresponding to 7% to 45% CH3CN. For 
this class I molecule, this region of the chromacogram was 
most abundant in peptides. Data from independent experiments 
were averaged as described below. 

The amino acid sequence analyses from four independent 
experiments were analyzed and the results are shown in Table 
3. For each position except the first, the data were analyzed 
by modifying the method described by Falk et al., supra , to 
allow for comparison of experiments from different HLA types. 
This modified procedure yielded quantitative yet standardized 
values while allowing the averaging of data from different 
experiments involving the same HLA type. 

The raw sequenator data was converted to a simple 
matrix of 10 rows (each representing one Edman degradation 
cycle) and 16 columns (each representing one of the twenty 
amino acids; W, C, R and H were eliminated for technical 
reasons. The data corresponding to the first row (first 
cycle) was not considered further because, this cycle is 
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usually heavily concaininaced by free amino acids.) . The 
values of each row were suinmed to yield a coral pmoles value 
for thac parricular cycle. For each row, values for each 
amino acid were then divided by the corresponding total yield 
value, to determine what fraction of the total signal is 
attributable to each amino acid at each cycle. By doing so, 
an "Absolute Frequency" table was generated. This absolute 
frequency table allows correction for the declining yields of 
each cycle. 
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Starring from che - absoiuce frequency cajDie, a 
"relative frequency" table was then generated to allow 
comparisons among different amino acids. To do so the data 
from each column was summed, and then averaged. Then, each 
value was divided next by the average column value to obtain 
relative frequency values. These values quantitate, in a 
standardized manner, increases and decreases per cycle, for 
each of the different sixteen amino acid types. Tables 
generated from data from different experiments can thus be 
added together to generate average relative frequency values 
(and their standard deviations) . All standard deviations can 
then be averaged, to estimate a standard deviation value 
applicable to the samples from each table. Any particular 
value exceeding 1.00 by more than two standard deviations is 
considered to correspond to a significant increase. 

Example 4 
Quantitative Binding Assays 
Using isolated MHC molecules prepared as described in 
20 Example 2, above, quantitative binding assays were performed. 
Briefly, indicated amoxints of MHC as isolated above were 
incubated in 0.05% NP40-PBS with *5 nM of radiolabeled 
peptides in the presence of 1-3 /xM jSjM and a cocktail of 
protease inhibitors (final concentrations 1 mM PMSF, 1.3 mM 
25 1-10 Phenanthroline, 73 /xM Pepscatin A, 8 mM EDTA, 200 /xM N-o;- 
p-tosyl-L- Lysine cailoromethyl ketone) . After various times, 
free and bound peptides were separated by TSK 2000 gel 
filtration, as described previously in A. Sette et al., J. 
Immunol . 148:844 (1992), which is incorporated herein by 
30 reference. Peptides were labeled by the use of the Chloramine 
T method Buus et al.. Science 235:1352 (1987), which is 
incorporated herein by reference. 

The HBc 18-27 peptide HLA binding peptide was 
radiolabeled and offered (5-10 nM) to 1 fjM purified HLA A2.1. 
35 After two days at 23 °C in presence of a cocktail of protease 
inhibitors and 1-3 ;iM purified human 02^' percent of MHC 

class I bound radioactivity was measured by size exclusion 
chromatography, as previously described for class II peptide 



10 
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binding assays in Secte ec ai . , m Seminars in Immune 1 cicrv . 
Vol. 3, Gefter, ed. (W.E. Saunders, Philadelphia, 1991), pn 
195-202, which is incorporated herein by reference. Using 
chis protocol, high binding (95%) was detected in all cases in 
5 the presence of purified HLA A2.1 molecules. 

To explore the specificity of binding, we determined 
whether the binding was inhibitable by excess unlabeled 
peptide, and if so, what the 50% inhibitory concentration 
(IC50%) might be. The rationale for this experiment was 

10 threefold. First, such an experiment is crucial in order to 
demonstrate specificity. Second, a sensitive inhibition assay 
is the most viable alternative for a high throughput 
quantitative binding assay. 'Third, inhibition data svibjected 
to Scatchard analysis can give quantitative estimates of the 

15 equilibrium constant (K) of interaction and the fraction of 
receptor molecules capable of binding ligand (% occupancy) . 
For instance, in analysis of an inhibition curve for the 
interaction of the peptide HBc "18-27 with A2.1, the IC50% was 
determined to be 25 nM. Further experiments were conducted to 

20 obtain Scatchard plots. For HBc 18-27/A2.1, six different 

experiments using six independent MHC preparations yielded a 
Kp of 15.5 ± 9,9 X 10'^ M and occupancy values of 6.2% (±1.4) . 

Several reports have demonstrated that class I 
molecules, unlike class II, are highly selective with regard 

25 to the size of the peptide epitope that they recognize. The 
optimal size varies between 8 and 10 residues for different 
peptides and different class I molecules, although MHC binding 
peptides as long as 13 residues have been identified. To 
verify the stringent size requirement, a series of N- and 

30 C- terminal trxincation/extension analogs of the peptide HBc 

18-27 were synthesized and tested for A2.1 binding. Previous 
studies had demonstrated that the optimal size for CTL 
recognition of this peptide was the 10-mer HBcl8-27 (Sette et 
al. supra) . It was found thar removal or addition of a 

35 residue at the C terminus of the molecule resulted in a 30 to 
100 -fold decrease in binding capacity. Further removal or 
addition of another residue completely obliterated binding. 
Similarly, at the N- terminus of the molecule, removal or 
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deiecion of one residue from che cpninial HHc 18-27 peptide 
compieceiy 5±)rogaced A2 . 1 binding. 

Throughouc this disclosure, results have been 
expressed in terms of IC50*s. Given the conditions in which 
5 our assays are run (i.e., limiting MHC and labeled peptide 
concentrations), these values approximate values. It 
should be noted that IC50 values can change, often 
dramatically, if the assay conditions are varied, and 
depending on the particular reagents used (e.g., Class I 
10 preparation, etc.). For example, excessive concentrations of 
MHC will increase the apparent measured IC50 of a given 
ligand. 

An alternative way of expressing the binding data, to 
avoid these uncertainties, is as a relative value to a 

15 reference peptide. The reference peptide is included in every 
assay. As a particular assay becomes more, or less, 
sensitive, the ICSO's of the peptides tested may change 
somewhat. However, the binding relative to the reference 
peptide will not change. For example, in an assay run under 

20 conditions such that the ICSO of the reference peptide 
increases 10 -fold, all ICSO values will also shift 
approximately ten- fold. Therefore, to avoid ambiguities, the 
assessment of whether a peptide is a good, intermediate, weak, 
or negative binder should be based on it's ICSO, relative to 

25 the ICSO of the standard peptide. 

The reference peptide for the HIiA-A2.1 assays 
described herein is referred to as 941.01 having a sequence of 
FLPSDYFPSV. An average ICSO of 5 (nM) was observed under the 
assay conditions utilized. 

30 If the ICSO of the standard peptide measured in a 

particular assay is different from that reported in the table, 
then it should be understood that the threshold values used to 
determine good, intermediate, weak, and negative binders 
should be modified by a corresponding factor. For exan^sle, if 

35 in an A2.1 binding assay, the ICSO of the A2 . 1 standard 

(941-01) were to be measured as 8 nM instead of 5 nM, then a 
peptide ligand would be called a good binder only if it had an 
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IC50 Of less ciian SO nM ;i.e., 8nM x O.i; , instead of rhe 
usual cur -off value of 50 nM. 

Example 5 



HLA-A2.1 3ir.dir!a Motif and Alaorighm 
The structural requirements for peptide bindino to 
A2.1 have been defined for both, 9-mer and 10-mer peptides. 
Two approaches have been used. The first approach referred to 
as the "?oly-A approach" uses a panel of single amino acid 
substitutions of a 9-mer prototype poly- A binder (ALAKAAAAV) 
that is tested for A2.1 binding using the methods of Example 4 
above to examine the degree of degeneracy of the anchor- 
positions and the possible influence of non-anchor positions 
on A2.1 binding. 

The second approach, the "Motif -Library approach", 
uses a large library of peptides selected from sequences of 
potential target molecules of viral and tumor origin and 
tested for A2.1 binding using the methods in Exan^jle 4 above. 
The frequencies by which different amino -acids occured at each 
position in good binders and non- binders were analysed to 
further define the role of non-anchor positions in g-mers and 
lO-mers. 

A2.1 binding of pentidp o-mBr-e 

PQly A Ajfpxn^rti A poly- A 9-mer peptide, containing 
the A2.1 motif L (Leu) in position 2 and V (Val) in position 9 
was chosen as a prototype binder. A K (Lys) residue was 
included in position 4 to increase solubility. A panel of 91 
single amino -acid substitution analogues of the prototype 
parental 9-mer was synthesized and tested for A2.1 binding 
(Table 4) . Shaded areas mark analogs with a greater than 10- 
fold reduction in binding capacity relative to the parental 
peptide. A reduction in binding greater than 100 -fold is 
indicated by hyphenation. 

Anchor- Positinnc; ? and Q -in p oly- a Analofye The 

effect of single-amino-acid substitutions at the anchor 
positions 2 and 9 was examined first. Most substitutions in 
these positions had profound detrimental effects on binding 
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capacity, thus confirming their role fcr hinding. More 
specifically, in position 2 only L and M bound within a 10- 
fold range ("preferred residues"). Residues with similar 
characteristics, such as I, V, A, and T were tolerated, but 
5 bound 10 to 100 -fold less strongly than the parental peptide. 
All the remaining substitutions (residues S, N, D, F, C, K, G, 
and F) were not tolerated and decreased binding by more than 
100-f old. ' Coirtparably stringent requirements were observed for 
position 9, where V, L and I were preferred and A and M are 
10 tolerated, while the residues T, C, N, F, and Y virtually 
abolished binding. According to this set of peptides, an 
optimal 2-9 motif could be defined with L, M in position 2 and 
V, I, or L in position 9. 
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Non- Anchor Posicicns 1 and 3-8 in oolv-A Analogs Al 
non-anchor positions were more permissive co different 
substitutions than the anchor-positions 2 and 9, i.e most 
residues were tolerated. Significant decreases in binding 
were observed for some substitutions in distinct positions. 
More specifically, in position 1 a negative charge (residues ] 
and E) or a P greatly reduced the binding capacity. Most 
substitutions were tolerated in position 3 with the exception 
of the residue K. Significant decreases were also seen in 
position 6 upon introduction of either a negative charge (D, 
E) or a positively charged residue (R) . A stiinmary of these 
effects by different single amino acid substitutions is given 
in Table 5. 
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TABLE 5 

Summary A2.1 Poly- A 



15 



AA position 


( + ) 


(+/-) 


(-) 


1 


FAYKVGSIT 




EDP 


2 


LM 


VITA 


SNDFCKG? 


3 


AFDEMYLSNPV 


K 




4 


CEVPATSD 






5 


NALYGEDKQ 






6 


FIAPCVYEG 


DR 




7 


YANLPVETQ 






8 


.UiGPFYQTNVEHK 






9 


VIL 


AM 


TCNFY 



Ratio > 0.1 Ratio 0.01-0.1 Ratio < O.Ol 



The Motif -Library Approach To further evaluate the 
importance of non-anchor positions for binding, peptides of 
potential target molecules of viral and tximor origin were 
scanned for the presence of sequences containing optimal 2-9 
anchor motifs. A set of 161 peptides containing a L or M in 
position 2 and a V, L or I in position 9 was selected, 
synthesized and tested for binding (see Example 6). Only 
11.8% of these peptides bind with high affinity (ratio aO.iO; 
22.4% were intermediate binders (ratio aO.l) , As many as 36% 
were weak binders (ratio <0.0l - 0.0001), and 31% were non- 
binders (ratio<0.000l) . The high number of non-binders 
containing optimal anchor-motifs indicates that in this set of 
peptides positions other than the 2-9 anchors influence A2.1 
binding capacity. Appendix 1 sets forth all of the peptides 
having the 2-9 motif used for this analysis and the binding 
data for those peptides. 

To define the influence on non- anchor positions more 
specifically, the frequency of occurrence of each amino acid 
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in each cf the non- anchor positions was caiculared for rhe 
good and intermediace binders on one hand and non -binders on 
Che other hand. Amino acids of similar chemical 
characceriscic were grouped together . Weak binders were not 
5 considered for the following analysis. The frequency of 
occurence of each amino acid in each of the non- anchor 
positions was calculated for the good binders and non-binders 
(Table 6) . 

Several striking trends become apparent. For example 
10 in position 1, only 3.6% of the A2 . 1 binders and as much as 
35% of the non-binders carried a negative charge (residues D 
' and E) . This observation correlates well with previous 
findings in the set of poly- A analogs, where a D or E 
substitution greatly affected binding. Similarly, the residue 
15 P was 8 times more frequent in non- binders than in good 

binders- Conversely, the frequencies of aromatic residues (Y, 
F, W) were greatly increased in A2,l binders as compared to 
non -binders. 
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Following chis approach, amino acids of siniilar 
scrucrural characterisnics were grouped zogecher. Then, rhe 
frequency of each amino acid group in each position was 
calculated for binders versus non-binders (Table 7) . Finally, 
Che frequency in Che binders group was divided by the 
frequency in the non-binders co obtain a "frequency ratio". 
This ratio indicates whether a given amino -acid or group of 
residues occurs in a given position preferentially in good 
binders (ratio >1) or in non- binders (ratio <1) . 

TABLE 7 
A2.1 9-mer PEPTIDES 
NUMBER OF PEPTIDES 161 

GOOD BINDERS 19 11.8V 

INTERMEDIATE BINDERS 3 6 22.4% 

WEAK BINDERS 58 3 6.0% 

NON-BINDERS 48 23.8% 





P08. 1 


po6 . 2 


pos. 3 


pos . 4 


pes . 5 


pes . 6 


pos . 7 


pos . 8 


pes . 9 




ratio 


ratio 


racio 


ratio 


ratio 


ratio 


ratio 


ratio 


ratio 


A 


2.6 


NA 


0,9 


0.9 


0.7 


0.9 


4 .4 


0,3 


NA 


6 


3-5 


NA 


0.4 


1.1 


1.1 


1.3 


0.4 


0.4 


NA 


D,E 


0.1 


NA 


0.0 


0.7 


0.3 


0.7 


0,1 


0.9 


NA 


R,H,K 


3.1 


NA 


0.2 


1.0 


0.9 


0.1 


0.0 


1.3 


NA 


L, V, I,M 


3.1 


1.0 


1.8 


0,5 


0.9 


1.3 


1.2 


1.7 


1.0 


Y,F,W 


7.0 


NA 


5.2 


0.9 


8.7 


2.0 


2.3 


2.6 


NA 


Q.N 


0.5" 


NA 


0.4 


1,2 


0.9 


1.0 


0.7 


0.3 


NA 


S,T,C 


0.7 


NA 


1,9 


4.8 


0.9 


1.2 


1.2 


1.1 


NA 


P 


0.1 


NA 


0.7 


0.7 


2.6 


1.7 


2.9 




NA 



indicaces that there were no negative binders 



Different Residues Influence A2 . 1 Bindincr In order to 
analyse the most striking influences of certain residues on 
A2.1 binding, a threshold level was set for the ratios 
described in Taile 7. Residues showing a more than 4- fold 
greater frequency in good binders were regarded as preferred 
residues ( + ) . Residues showing a 4- fold lower frequency in 
A2.1 binders than in non-binders were regarded as disfavored 
residues {-). Following this approach, residues showing the 
most prominent positive or negative effects on binding are 
listed in Table 8. 
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This cable identifies the ammo acid groups which 
influence binding most significantly in each of the non- anchor 
positions. In general, the most negative effects were observed 
with charged amino acids. In position 1, negatively, charaed 
5 amino acids were not obsezrved in good binders, i.e., those 

amino acids were negative binding residues at position i. The 
opposite was true for position o /nere only basic amino acids 
were detrimental for binding i.e., were negative binding 
residues. Moreover, both acidic and basic amino acids were not 
10 observed in A2 . 1 binders in positions 3 and 7. A greater than 
4- fold increased frequency of non- binders was found when P was 
in position 1. 

TABLE 8 

Summary of A2 . 1 Motif -Library, 9-mers 



25 



AA POSITION 


( + ) 


(-) 


1 


(YFW) 


P, (DE) 


2 


Anchor 




3 


(YFW) 


(DE) , (RKH) 


4 


(STC) 




5 


(YFW) 




6 




(RKH) 


7 


A 


(RKH) , (DE) 


8 






9 


Anchor 





(+) = Ratio a 4- fold (-) = Ratio s 0.25 



30 Aromatic residues were in general favored in several of 

the non- anchor positions, particularly in positions 1, 3, and 
5. Small residues like S, T, and C were favored in position 4 
and A was favored in position 7. 

An Improved A2 . 1 9-mer Mnr-if The data described above 

35 was used to derive a stringent A2.1 motif. This motif is 
based in significant part on the effects of the non- anchor 
positions 1 and 3-8. The uneven distribution of amino acids at 
different positions is reflective of specific dominant 
negative binding effects of certain residues, mainly charged 
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ones, on ninding affinity. A series cf rules were Ger:.ved ro 
idencify appropriace anchor residues in positions 2 and 9 and 
negative binding residues ac posinions 1 and 3-8 to enable 
selection of a high affinity binding immunogenic peptide. 
5 These rules are summarized in Table 9 . 

To validate the motif defined above and shown in Table 9 
published sequences of peptides that have been naturally 
processed and presented by A2 . 1 molecules were analysed (Table 
10). Only 9-mer peptides containing the 2-9 anchor residues 

1 0 were cons idered . 

When the frequencies of these peptides were analysed, it 
was found that in general they followed the rules summarized 
in Table 9. More specifically, neither acidic amino acids nor 
P were found in position l. Only one acidic amino acid and no 

15 basic amino acids were found in position 3, Positions 6 and 7 
showed no charged residues. Acidic amino acids, however, were 
frequently found in position 8, where they are tolerated, 
according to our definition of the A2.1 motif. The analysis of 
the sequences of naturally processed peptides therefore 

20 reveals that >90% of the peptides followed the defined rules 
for a complete motif. 

Thus the data confirms a role of positions other than the 
anchor positions 2 and 9 for A2.1 binding. Most of the 
deleterious effects on binding are induced by charged amino 

25 acids in non-anchor positions, i.e. negative binding residues 
occupying positions 1, 3, 6 or 7. 
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TABLE 9 

A2,l MOTIF FOR 9-MER FHFTIDES 



10 



AA Posxcicn 


( + ) 


(-) 


i;.. 




acidic : : :amino acida ;ai^ P \ • 




: Anchor: I*; . (t, V.,.A,T>; 




■3v;. 




;acidlc: and: basic:: antiM^a nA rfg : 


4 






5 










hasic. vamino-aclds : 






jiiiacidicc: aiKi:::::baBic ::::amiiio:^a«idss^^ 


8 
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TABLE 10 

A2 . 1 nacuraiiy processed peptides 





2 


3 


4 


— 


o 


7 1 


s 


Q 


A2 . 1 binding 


A 


ij 


X 


G 


G 


X 


V 


N 


V 


ND 


L 




D 


V 


? 


T 


A 


A 


V 


ND 


G 


X 


V 


P 


F 


X 


V 


S 


V 


0.41 


s 


L 


L 


P 


A 


I 


V 


E 


L 


0.19 


s 


X 


X 


V 


R 


A 


X 


E 


V 


ND 


Y 


M 


N 


G 


T 


M 


s 


Q 


V 


ND 


K 


X 


N 


E 


P 


V 


X 


X 


X 


ND 


Y 


L 


L 


P 


A 


I 


V 


H 


I 


0.26 


A 


X 


W 


G 


F 


F 


p 


V 


X 


ND 


T 


L 


w 


V 


D 


P 


Y 


E 


V 


0,23 


G 


X 


V 


P 


F 


X 


V 


S 


V 


0.41 



Binding of Peptide lO-mers 

The "Motif -Library" Approach Previous data clearly 
indicated that 10-mers can also bind to HLA molecules even if 
with a somewhat lower affinity than 9-mers. For this reason we 
expanded our analysis to 10-mer peptides. 

Therefore, a "Motif -Library" set of 170 peptide 10- 
mers containing optimal motif - combinations was selected from 
known target molecule sequences of viral ajid tumor origin and 
analysed as described above for 9-mers. In this set we found 
5.9% good binders, 17.1% intermediate binders, 41.2% weak 
binders and 35.9% non-binders. The actual sequences, origin 
and binding capacities of this set of peptides are included as 
Appendix 2. This set of 10-mers was used to determine a) the 
rules for 10-mer peptide binding to A2.1, b) the similarities 
or differences to rules defined for 9-.mers, and c) if an 
insertion point can be identified that would allow for a 
superimposa±Dle common motif for 9-mers and 10-mers, 



wo 94/20127 PCT/US94/02353 

52 

Amino-acid frequencies and frequency racios fcr the 
various amino-acid groups for each position were generated for 
10-mer peptides as described above for 9-rner peptides and are 
also shown in Tables 11 and 12, respectively for grouped 
residues . 

A summary of preferred versus disfavored . residues 
and of the rules derived for the 10-mers in a manner analogous 
to that used for 9-mers, is also listed in Tables 13 and 14, 
respectively. 

When the frequency- ratios of different amino-acid 
groups in binders and non -binders at different positions were 
analysed and compared to the corresponding ratios for the 9- 
mers, both striking similarities and significant differences 
emergetr— (Table 15). At the N- terminus and the C- termini of 9- 
15 mers "and' 10-mers, similarities predominate. In position 1 for 
example, in 10-mers again the P residue and acidic amino acids 
were not tolerated. In addition at position 1 in 10-mers 
aromatic residues were frequently observed in A2.1 binders. In 
position 3, acidic amino acids were frequently associated with 
poor binding capacity in both 9 -mers and 10-mers. 
Interestingly, however, while in position 3 aromatic residues 
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TABLE 13 

Summary of A2 . 1 Motif - Library 10-mers 





{ + ) 




X 


( VTrw\ 2X 

\ X ir Inf ; , 2-1 




4, 






'X 






4 






c 

3 




IT 


g 


G 




7 




(RKH) 
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(YFW) , (LVIM) 


(DE) , (RKH) 


9 




(RKH) 


10 


Anchor 





(+) = Ratio a 4- fold 



(-) = Ratio s 0.25 
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TABLE 14 
A2.1 MOTIF FOR 10-MER PEPTIDES 



AA 
Position 


(+) 


(-) 






acid Ic. • aminQ * i n rf p 


















■ i»sic;::ainxiia:-ratc 
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..7 . 




hajsxc:; amino- acids 
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; Anchor;-.-" V, ^X^:. lil-^AVH) 
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TABLE 15 

COMPARISON OF A2.1 BINDING OF 9-MERS AND 10-MERS 

9-mers 10-mers 



AA Position [ (+) 








2 
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(YWF> 


(LVTM) 


4 


(STC) 
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(YWF) 




6 
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7 






8 






9 






10 


- 




AA Position 


9-mers 

(.) 


10-mers 

1 ^ > 


1 






2 






3 




: • '•' ' • '= ■ i-H^^?^^^^ v;^ "; ; " l '?:.:::^^. : ■ • • I • 
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(RKH) 


10 
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were preferred in 9-mers, aliphatic resiLdues (L, V, m) were 
preferred in lO-mers. 

At the C-teminus of the peptides, basic amino acids 
are not favored in position 7, and both acidic and basic aniino 
acids are not favored in position 8 for 10-mers, This is in 
striking agreement with the observation that the same pattern 
was found in 9-mers at positions 6 and 7. Interestingly, again 
the favored residues differ between two peptides sizes. 
Aromatic (Y, F, W) or aliphatic (L, V, I, M) . residues were 
preferred in 10-mers at position 8, while the A residue was 
preferred by 9-mers in the corresponding position 7. 

By contrast, in the center of the peptide no 
similarities of frequency preferences were observed at 
positions 4, 5, and 6 in 10-mers and positions 4 and 5 in the 
9-mers . 

Most interestingly, among the residues most favored 
in the center of the tested peptides were G in position 4 and 
6, P in position 5 was not observed in binders. All of these 
residues are known to dramatically influence the overall 
secondary structure of peptides, and in particular would be 
predicted to strongly influence the propensity of a 10-mer to 
adopt a "kinked" or "bulged" conformation. 

Charged residues are predominantly deleterious for 
binding and are frequently observed in non-binders of 9-mers 
and 10-mers. 

However, favored residues are different for 9-mers 
and 10-mers. Glycine is favored while Proline is disfavored 
in the center of 10-mer peptides but this is not the case for 
9-mers. 

These data establish the existence of an "insertion 
area" spanning two positions (4, 5) in 9-mers and 3 positions 
(4, 5, S) in 10-mers. This insertion area is a more 
permissive region where few residue similarities are observed 
between the 9-mer and 10-mer antigenic peptides. Furthermore, 
in addition to the highly conserved anchor positions 2 and 9, 
there are "anchor areas" for unfavored residues in positions i 
and 3 at the N- terminus for both 9-mer and 10-mer and 



wo 94/20n7 PCTyTJS94/02353 

59 

positions 7-i0 or 6-9 at: the C- terminus for 10-mers and 9- 
mers , respectively . 

Example 6 

5 Algorithm to Predict Binding of 9-mer Peptides to HIiA-A2.1 

Within the population of potential A2 . 1 binding 
peptides identified by the 2,9 motif, as shown in the previous 
example, only a few peptides are actually good or intermediate 
binders and thus potentially immunogenic. It is apparent from 

10 the data previously described that the residues present in 

positions other than 2 and 9 can influence, often profoundly, 
the binding affinity of a peptide. For example, acidic 
residues at position 1 for A2 . 1 peptides do not appear to be 
tolerated. .Therefore, a more exact predictor of binding could 

15 be generated by taking into account the effects of different 
residues at each position of a peptide sequence, in addition 
to positions 2 and 9, 

More specifically, we have utilized the data bank 
obtained during the screening of our collection of A2.1 motif 

20 containing 9-mer peptides to develop an algorithm which 

assigns a score for each amino acid, at each position along a 
peptide. The score for each residue is taken as the ratio of 
the frequency of that residue in good and intermediate binders 
to the frequency of occurrence of that residue in non-binders. 

25 In the present "Grouped Ratio'' algorithm residues 

have been grouped by similarity. This avoids the problem 
encountered with some rare residues, such as tryptophan, where 
there are too few occurrences to obtain a statistically 
significant ratio. Table 16 is a listing of scores obtained 

30 by grouping for each of the twenty amino acids by position for 
9-mer peptides containing perfect 2/9 motifs. A peptide is 
scored in the "Grouped Ratio" algorithm as a product of the 
scores of each of its residues. In the case of positions 
other than 2 and 9 , the scores have been derived using a set 

35 of peptides which contain only preferred residues in positions 
2 and 9. To enstble us to extend our "Grouped Ratio" algorithm 
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TABLE 16 





1 


2 


3 


4 


5 


6 


7 


8 


9 






















A 


2.6 


0.03 


0.87 


0.87 


0.65 


0.87 


4.4 


0,29 


0 . 16 


C 


0 . 73 


0.01 


1.9 


4.8 


0.87 


1.2 


.1.2 


1.1 


0 . 01 


D 


0.10 


0.01 


0.10 


0.65 


0.29 


0.65 


0.11 


0.87 


0 . 01 


E 


0.10 


0.01 


0.10 


0,65 


0.29 


0.65 


0.11 


0.87 


0 . 01 


F 


7.0 


0.01 


5.2 


0.87 


8.7 


^2.0 


2.3 


2.6 


0 . 01 


G 


3.5 


0.01 


0.44 


1.1 


1.1 


1.3 


0.44 


0. 44 


0 . 01 


H 


3.1 


0.01 


0.22 


1.0 


0.87 


0.09 


0.10 


1.3 


0 . 01 


I 


3.1 


0.14 


1.8 


0,55 


0.87 


1.4 


1.2 


1. 8 


0 . 40 


K 


3.1 


0.01 


0.22 


1.0 


0.87 


0 . 09 


0 . 10 


1.3 


0 . 01 


L 


3.1 


1.00 


1.8 


0,55 


0.87 


1.4 


1.2 


1.8 


0 . 09 


M 


3.1 


2.00 


1.8 


0,55 


0 . 87 


1.4 


1.2 


1.8 


0 . 06 


N 


0.50 


0.01 


0.37 


1.2 


0.87 


1.1 . 


0 . 65 


0 .33 


0 . 01 


P 


0.12 


0.01 


0.70 


0,73 


2.6 


1-8 


2.9 


0.10 


0.01 


Q 


0.50 


0.01 


0.37 


1,2 


0.87 


1.1 


0.65 


0.33 


0.01 


R 


3.1 


0.01 


0.22 


1.0 


0.87 


0.09 


0.10 


1.3 


0.01 


S 


0.73 


0.01 


1.9 


4.8 


0.87 


1.2 


1.2 


1.1 


0,01 


T 


0.73 


0,01 


1.9 


4.8 


0.87 


1.2 


1.2 


1.1 


0,01 


V 


3.1 


0.08 


1.8 


0.55 


0.87 


1.4 


1.2 


1.8 


1.00 


w 


7.0 


0,01 


5.2 


0.87 


8.7 


2.0 


2.3 


2.6 


0.01 


Y 


7.0 


0.01 


5.2 


0.87 


8.7 


2.0 


2.3 


2.6 


0.01 



wo 94/20127 



PCT/US94/023S3 



61 

to peptides which may have residues other rhan the preferred 
ones at 2 and 9, scores for 2 and 9 have been derived from a 
set of peptides which are single amino acid substitutions at 
positions 2 auad 9, Figure 2 shows a scattergram of the log of 
5 relative binding plotted against "Grouped Ratio" algorithm 

score for our collection of 9-mer peptides from the previous 
■ example. 

The present "Grouped Ratio" algorithm can be used to 
predict a population of peptides with the highest occurrence 

10 of good binders. If one were to rely, for example, solely on 
a 2(L,M) and 9 (V) motif for predicting A2 . 1 binding 9-mer 
peptides, it would have been predicted that all 160 peptides 
in our database would be good binders. In fact, as has 
already been described, only 12% of these peptides would be 

15 described as good binders and only 22% as intermediate 

binders; 66% of the peptides predicted by such a 2,9 motif are 
either weak or non-binding peptides. In contrast, using the 
"Grouped Ratio" algorithm described above, and selecting a 
score of 1.0 as threshold, 41 peptides were selected. Of this 

20 set, 27% are good binders, and 49% are intermediate, while 
only 20% are weak and 5% are non-binders (Table 17) . 

The present example of an algorithm has used the' 
ratio of binders /non- binders to measure the impact of a 
particular residue at each position of a peptide. It is 

25 immediately apparent to one of ordinary skill that there are 
alternative ways of creating a similar algorithm. 

An algorithm using the average binding affinity of 
all the peptides with a certain amino acid (or amino acid 
type) at a certain position has the advantage of including all 

30 of the peptides in the analysis, auad not just 

good/ intermediate binders and non-binders. Moreover, it gives 
a more quantitative measure of affinity than the simpler 
"Grouped Ratio" algorithm. We have created such an algorithm 
by calculating for each amino acid, by position, the average 

35 log of binding when that particular residue occurs in our set 
of 160 2,9 motif containing peptides. These values are shown 
in Table 18. The algorithm score for a peptide is then taken 
as the sum of the scores by position for each residues. 
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Figure 5 shows a scattsrgram of the log of relative binding 
against the average "Log of Binding" algorithm score. Table 
17 shows the ability of the two algorithms to predict peptide 
binding at various levels, as a function of the cut-off score 
5 used. The ability of a 2,9 motif to predict binding in the 

same peptide set is also shown for reference purposes. it is 
clear from this comparison that both algorithms of this 
invention have a greater ability to predict populations with 
higher frequencies of good binders than a 2,9- motif alone. 

10 Differences between, the "Grouped Ratio" algorithm and the "Log 
of Binding" algorithm are small in the set of peptides 
analyzed here, but do suggest that the "Log of Binding" 
algorithm is a better, if only slightly, predictor than the 
"Grouped Ratio" algorithm. 

15 The log of binding algorithm was further revised in 

two ways. First, poly- alanine (poly- A) data were incorporated 
into the algorithms at the anchor positions for residues 
included in the expanded motifs where data obtained by 
screening a large library of peptides were not available. 

20 Second, am "anchor requirement screening filter" was 

incorporated into the algorithm. The poly -A approach is 
described in detail, above. The "anchor requirement screening 
filter" refers to the way in which residues are scored at the 
anchor positions, thereby providing the ability to screen out 

25 peptides which do not have preferred or tolerated residues in 
the anchor positions. This is accomplished by assigning a 
score for unacceptable residues at the anchor positions which 
are so high as to preclude any peptide which contains them 
from achieving an overall score which would allow it to be 

3 0 considered as a potential binder. 

The results for 9-mers and 10-mers are presented in 
Tables 26 cuid 27, below. In these tables, values are group 
values as follows: A; G; P; D,E; R,H,K; L,I,V,M; F,Y,W; S,T,C; 
and Q,N, except where noted in the tables. 
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TABLE 18 





1 


2 


3 


4 


5 




7 


8 


9 






















A 


•2.38 


-3 .22 


-2.80 


-2.68 


-2.89 -2.70 


-2.35 


-3 .07 


-2 .49 


C 


-2.94 


-4 .00 


-2.58 


-1.96 . 


-3 .29 


-2 ,22 


-2 .97 


-2.37 


-4 .00 


D 


-3 .69 


-4 .00 


-3 .46 


-2.71 


-2.26 


-2.63 


-3 .61 


-3.03 


-4 .00 


E 


-3.64 


-4 .00 


-3 .51 


-2.65 


-3.39 


-3 .41 


-3.21 


-2.63 


-4 .00 


F 


-1.89 


-4.00 


-2.35 


-2.50 


-1.34 


-2 .43 


-2.18 


-1.71 


-4 .00 


G 


-2.32 


-4 .00 


-3 .04 


-2.63 


-2.56 


-2.30 


-3.13 


-2.96 


-4.00 


H 


-2,67 


-4 .00 


-2.58 


-2.58 


-2.05 


-3 .32 


-3 .13 


-2.16 


-4 .00 


I 


-1.65 


-2.55 


-2.80 


-3 .44 


-2.74 


-2 .79 


-2.20 


-2.69 


-2.10 


K 




-4 .00 


-3.65 


-2.93 


-3.34 


-3 .77 


-3.13 


-3.27 


-4 .00 


L -2 .32 


-1.70 


-2.02 


-2.49 


-2.71 


-2 .63 


-2 .62 


-2.01 


-2 .74 


M 


-'•9 


-1.39 


-1.79 


-3.07 


-3.43 


-1.38 


-1.33 


-0.97 


-2.96 


N 


-3 .12 


-4 .00 


-3 .52 


-2.22 


-2.36 


-2.30 


-3 .14 


-3.31 


-4 .00 


P 


-3 .61 


-4 .00 


-2.97 


-2.64 


-2.42 


-2 .31 


-1.83 


-2.42 


-4.00 


Q 


-2 .76 


-4 .00 


-2.81 


-2.63 


-3.06 


-2 .84 


-2.12 


-3.05 


-4.00 


R 


-1.92 


-4 .00 


-3.41 


-2.61 


-3,05 


-3 .76 


-3 .43 


-3.02 


-4.00 


S 


-2.39 


-3 .52 


-2.04 


-2.12 


-2.83 


-3 .04 


-2.73 


-2.02 


-4.00 


T 


-2.92 


-4 .00 


-2.60 


-2.48 


-2.17 


-2 .58 


-2.67 


-3 .14 


-3.70 


V 


-2.44 


-2.64 


-2.68 


-3.29 


-2 .49 


-2.24 


•2.68 


-2.83 


-1.70 


W 


-0.14 


-4 .00 


-1.01 


-2.94 


-1.63 


-2 .77 


-2.85 


-2.13 


-4 .00 


X 


-1.99 


-2.13 


-2.41 


-2.97 


-2.72 


-2 .70 


-2 .41 


-2.35 


-2.42 


Y 


-1.46 


-4 .00 


-1.67 


-2.70 


-1.92 


-2 .39 


-1.35 


-3.37 


-4 .00 
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Examisle 7 

Use of an Alaoritihin to Predict Bin ding of lO-mer Pepcides to 

HIA-A2.1 

Using the methods described in the proceeding 
example, an analogous set of algorithms has been developed for 
predicting the binding of lO-mer peptides. Ta±3le 19 shows the 
scores used in a "Grouped" Ratio" algorithm, and Table 20 shows 
the "Log of Binding" algorithm scores, for 10-mer peptides. 
Table 21 shows a comparison of the application of the two 
different algorithmic methods for selecting binding peptides. 
Figures 4 and 5 show, respectively, scattergrams of a set of 
10-mer peptides containing preferred residues in positions 2 
ajid 10 as scored by the "Grouped Ratio" and "Log of Binding" 
algorithms . 
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TABLE 19 





1 


2 


3 


4 


5 


6 


7 


8 


9 


10 
























A 


3.00 


0.01 


3 .10 


0.20 


1.60 


0.60 . 


1.30 


1.60 


6.50 


0.01 


C 


0.90 


0.01 


0.90 


1.10 


r.oo 


0.90 


1.40 


1.30 


2.90 


0.01 


D 


0.01 


0.01 


0.20 


0.60 


0.30 


1.00 


0.30 


0.01 


0 .40 


0.01 


E 


0.01 


0.01 


0.20 


0.60 


0.30 


1.00 


0.30 


0.01 


0.40 


0.01 


F 


3.00 


0.01 


2.60 


3.10 


3.60 


0.60 


1.60 


14 .1 


2.10 


0.01 


G 


0.80 


0.01 


0.50 


4.70 


0.80 


6.30 


2.70 


0.70 


0.80 


0.01 


H 


1.20 


0.01 


0.30 


0.10 


0.70 


0.40 


0.20 


0.01 


0.20 


0.01 


I 


3.00 


0,50 


10.2 


1.00 


1.30 


2.10 


1.40 


4 .70 


0.80 


1.00 


K 
L 


1.20 
3.00 


0.01 
1.10 


0.30 
10.2 


0.10 
1.00 


0.70 
1.30 


0.40 
2.10 


0.20 
1.40 


0.01 
4 .70 


0.20 
0.80 


0.01 
0.50 


M 


3.00 


0.60 


10.2 


1.00 


1.30 


2.10 


1.40 


4 .70 


0.80 


0.01 


N 


1.00 


0.01 


0.90 


0.80 


0.80 


0.80 


0.60 


0.40 


0.70 


0.01 


P 


0.00 


0.01 


0.40 


2.60 


0.01 


1.00 


0.40 


1.90 


1.20 


0.01 


Q 


1 . 00 


0 . 01 


0 .90 


0 . 80 


0.80 


0.80 


0 . 60 


0.40 


0.70 


0.01 


R 


1.20 


0.01 


0.30 


0.10 


0.70 


0.40 


0.20 


0.01 


0.20 


o.or 


S 


0.90 


0.01 


0.90 


1.10 


1.00 


0.90 


1.40 


1.30 


2.90 


0.01 


T 


0.90 


0.01 


0.90 


1.10 


1.00 


0.90 


1.40 


1.30 


2.90 


0.01 


V 


3.00 


0.10 


10.2 


1.00 


1.30 


2.10 


1.40 


4.70 


0.80 


2.30 


W 


3.00 


0.01 


2.60 


3 .10 


3.60 


0.60 


1.60 


14 .1 


2.10 


0.01 


Y 


3.00 


0.01 


2.60 


3 .10 


3.60 


0.60 


1.60 


14 .1 


2.10 


0.01 
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Example S 

Binding of A2 . 1 Alaorithin Predicted Pepcides 

The results of Examples 6 and 7 indicate that an algorithm can 
be used to select peptides that bind to HIiA-A2 . 1 sufficiently 
to have a high probability of being immunogenic. 
To test this result, we tested our algorithm on a large (over 
1300) non- redundant , independent set of peptides derived from 
various sources. After scoring this set with our algorithm, 
we selected 41 peptides (Table 21) for synthesis, and tested 
them for A2.1 binding. This set of peptides was comprised of 
21 peptides with high algorithm scores, and 20 peptides with 
low algorithm scores . 

The binding data and categorization profile are shown in 
Tables 22 and 23 respectively. The correlation between 
binding and algorithm score was 0.69. It is immediately 
apparent from Table 23 the striking difference between 
peptides with high algorithm scores, suad those with low 
algorithm scores. Respectively, 76% of the high scorers and 
none of the low scorers were either good or intermediate 
binders* This data demonstrates the utility of the algorithm 
of this invention. 



wo 94/20127 



PCT/US94/02353 



TABLE 22 

A2 . 1 Algorithm 



SEQUENCE 


SOURCE 


Binding 


Score 


i jnn r V V 4j X V 


CMV 


u . / o 


346 


VT.T.T.VPQPV 


uav 


n T c 


312 


VT.VPT.WTTPT. 




n 7? 


169 


CMIiTI'V I .VI' I . 
c sin X X ^ V X 




C Q 

u . o e 


336 


T.T.tJWTTTT.T. 


. V 


n A Q 


356 


GT.WCVLFFV 




\j ,*± / 


1989 


' ' Xv w V XjA V 




n AC 


296 


T.T.T.r^T.PgT. 


^X3V 


U • 4^ 


1356 


PT.T.TgT.PFT. 


HSV 


U • ^ 4 


859 


T.TT.WnV^T. 


HSV 


n o 0 
u . 


390 






ft 1 c 


1746 


xWUjXaw va V 




rt ft Q O 


411 


X^^S^lrx^ V xrxu V 




ft ft A O 


392 


V ^nAX f X V 


CMV 


ft ft A. ^ 


196 


PT.T.PffTTPT. 


V 


n ft 1 "7 


494 


WMWTTP Q VT .T . 


Uuv 


ft ft ^ c 
U . 0 J 6 


362 


X ^WWUX X llAJ 




ft ft •? T 


1043 


WK^UUA V V f V 


no V 


ft ftT T 


1422 


nr/SPYOOIiV 




ft ft n Q Q 


184 


TAfppPVAEti 

A4i 1 r tr F V.WRXJ 


V 


ft ftftCC 


416 


T.MnPV^rPPT. 
lUnwAx X f fuj 




ft ftftCC 


414 


x^xjx nfu^x nxj 


V 


ft ft AC ^ 


250 


aMppPVLHV 


V 


ft ft ft ^ A 


628 


ALIiLVLALL 


^x^v 


ft ft m A 


535 


#VXXt^ W A Vrf X A Ad 


CMV 


ft ftftft C 


602 


MLNVMKEAV 


CMV 


ft ftftO Q 


0 .00031 


TMEIMTRTV 

X XXB IM » t <v X V 


V Jl V 


ft ft ft O Q 


0.0013 




HSV 


ft ft ft ft Q 


0 . 0019 




CMV 


ft ft Oft c 


0 . 00021 


EL2IFSERli 


HSV 


ft ft Aft "5 

U . 0U02 


0 . 0020 


FLRVQQKAL 


HSV 


ft ft Aft ? 


0 .00099 


ELQMMQDWV 


CKV 


ft ft ft m 


0 . 0020 


QLNAMKPDL 


MT 


ft 0001 


0 . 0017 


GLRQLKGAL 


CMV 


0.0001 


0.0010 


TLRMSSKAV 


HSV 


0.0001 


0.00085 


SLRIKRSLL 


CMV 


0 


0.00041 


DLKQMERW 


CMV 


0 


0.00026 


PLRVTPSDL 


CMV 


0 


0.0019 


QLDYEKQVL 


CMV 


0 


0.0012 


WLKLLRDAL 


CMV 


0 


0.0012 


PMEAVRHPL 


CMV 


0 


0.0011 


ELKQTRVNL 


CMV 


0 


0.00053 


NLEVIHDAL 


CMV 


0 


0.00050 


KLKKVKSVL 


HSV 


0 


0.00033 


PLAYERDKL 


CMV 


0 


0.00017 
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Example 9 

Ex vivo induction of Cytotoxic T Lymphocytes fCTT.) 

Peripheral blood mononuclear cells (PBMC) are 
isolated from an HLA- typed patient by either venipuncture or 
apheresis (depending upon the initial amount of CTLp 
required) , and purified by gradient centrifugarion using 
Ficoll-Paque (Pha~_-cia) . Typically, one can obtain one 
million PBMC for every ml of peripheral blood, or 
alternatively, a typical apheresis procedure can yield up to a 
total of 1-10 X 10^° PBMC. 

The isolated and purified PBMC are co- cultured with 
an appropriate number of antigen presenting cell (APC) , 
previously incubated ("pulsed") with an appropriate amount of 
synthetic peptide (containing the HLA binding motif and the 
sequence of the antigen in question) . PBMC are usually 
incubated at 1-2 X 10^ cells/ml in culture medium such as 
RPMI-1640 (with autologous senam or plasma) or the serum- free 
medium AIM-V (Gibco) . 

APC are usually used at concentrations ranging from 
1^ to 2X10^ cells/ml, depending on the type of cell used. 
Possible sources of APC include: 1) autologous dendritic cells 
(DC) , which are isolated from PBMC and purified as described 
(Inaba, et al., J. Exp. Med. 166:182 (1987)); and 2) mutant 
and genetically engineered mammalian cells that express 
"empty" HLA molecules (which are syngeneic [genetically 
identical] to the patient's allelic HLA form), such as the, 
mouse RMA-S cell line or the human T2 cell line. APC 
containing empty HLA molecules are known to be potent inducers 
of CTL responses, possibly because the peptide can associate 
more readily with empty MHC molecules than with MHC molecules 
which are occupied by other peptides (DeBruijn, et al., Eur. 
J. Immunol . 21:2963-2970 (1991)). 

In those cases when the APC used are not autologous, 
the cells will have to be gamma irradiated with an appropriate 
dose (using, e.g., radioactive cesixim or cobalt) to prevent 
their proliferation both ex vivo . "and when the cells are 
re- introduced into the patient^. 
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The mixture cultures, containing PBMC, APC and 
peptide are kept in an appropriate culture vessel such as 
plastic T- flasks, gas -permeable plastic bags, or roller 
bottles, at 37° centigrade in a humid air/C02 incubator. 
After the activation phase of the culture, which usually 
occurs during the first 3-5 days, the resulting effector CTL 
can be further expanded, by the addition of recombinant 
DNA-derived growth factors such as interleukin-2 {IL-2) , 
interleukin-4 (IL-4) , or interleukin-7 {IL-7) to the cultures. 
An expansion culture can be kept for an additional 5 to 12 
days, depending on the numbers of effector CTL required for a 
particular patient. In addition, expansion cultures may be 
performed using hollow fiber artificial capillary systems 
(Cellco) , where larger numbers of cells (up to 1X10^^) can be 
maintained. 

Before the cells are infused into the patient, they 
are tested for activity, viability, toxicity and sterility. 
The cytotoxic activity of the resulting CTL caji be determined 
by a standard ^^Cr- release assay (Biddison, W.E. 1991, Current 
Protocols in Immunology, p7, 17 . 1-7 . 17 . 5 , Ed. J. Coligan et 
al., J. Wiley and Sons, New York), using target cells that 
express the appropriate HLA molecule, in the presence and 
cdDsence of the immunogenic peptide. ViaJbility is determined 
by the exclusion of trypaji blue dye by live cells. Cells are 
tested for the presence of endotoxin by conventional 
techniques. Finally, the presence of bacterial or fungal 
contamination is determined by appropriate microbiological 
methods (chocolate agar, etc.). Once the cells pass all 
quality control and safety tests, they are washed and placed 
in the appropriate infusion solution (Ringer/glucose lactate) 
and infused intravenously into the patient. 

Example 10 
Assays for CTL Activity 
1, Peptide synthesis : Peptide syntheses were carried 

out by sequential coupling of "N-a-Fmoc -protected amino acids 
on an Applied Biosystems (Foster City, CA) 43 OA peptide 
synthesizer using standard Fmoc coupling cycles (software 
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version 1.40). All amino acids, reagents, and resins were 
obtained from Applied Biosystems or Bachem. Solvents were 
obtained from Burdick & Jackson. Solid- phase synthesis was 
started from an appropriately substituted Fmoc- amino acid- 
5 Sasrin resin. The loading of the starting resin was 0.5-0.7 
mmol/g polystyrene, and 0.1 or 0.25 meq were used in each 
synthesis. A typical reaction cycle proceeded as follows: 1) 
The N- terminal Fmoc group was removed with 25% piperidine in 
dimethyl formamide (DMF) for 5 minutes, followed by another 

10 treatment with 25% piperdine in DMF for 15 minutes. The resin 
was washed 5 times with DMF. An N-methylpyrolidone (NMP) 
solution of a 4 to 10 fold excess of a pre- formed l- 
hydroxybenzotriazole ester of the appropriate Fmoc -amino acid 
was added to the resin and the mixture was allowed to react 

-5 for 30-90 min. The resin was washed with DMF in preparation 
for the next elongation cycle. The fully protected, resin 
bound peptide was subjected to a piperidine cycle to remove 
the terminal Fmoc group. The product was washed with 
dichl or ome thane and dried. The resin was then treated with 

20 trif luoroacetic acid in the presence of appropriate scavengers 
[e.g. 5% (v/v) water] for 60 minutes at 20*C. After 
evaporation of excess trif luoroacetic acid, the crude peptide 
was washed with dimethyl ether, dissolved in water and 
lyophilized. The peptides wee purified to >95% homogeneity by 

25 reverse-phase HPLC using H2O/CH3CN gradients containing 0.2% 
TFA modifier on a Vydac, 300A pore- size, C-18 preparative 
column. The purity of the synthetic peptides was assayed on 
an analytical reverse-phase column, and their composition 
ascertained by amino acid analysis and/or sequencing. 

30 Peptides were routinely dissolved in DMSO at the concentration 
of 20 mg/ml. 

2. Media: RPMI-1640 containing 10% fetal calf serum 

(FCS) 2 mM Glutamine, 50 /xg/ml Gentamicin and 5x10 '^M 2- 
mercaptoethanol served as culture medium and will be referred 
35 to as RIO medium. 

RPMI-1640 containing 25 mM Hepes buffer and 
supplemented with 2% FCS was used as cell washing medium. 
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3^ Rat Concariavalin A supernatanc : The spleen cells 

obtained from Lewis rats (Sprague-Dawley ) were resuspended at 
a concentration of 5x10^ cells/na in RIO medium supplemented 
with 5 fig/ml of ConA in 75 cm2 tissue culture flasks. After 
48 hr at 37 *C, the supernatants were collected, supplemented 
with 1% q; -methyl -D-mannoside and filter sterilized ,(.45 ^un 
filter). Aliquots were stored frozen at -20 'C. 

4. LPS -activated lymphoblasts : Murine splenocytes were 
resuspended at a concentration of l-1.5xl0^/ml in RlO medium 
supplemented with 25 fis/rcLl LPS and 7 fig/vcil dextran sulfate in 
75 cm^ tissue culture flasks. After 72 hours at 37 'C, the 
lymphoblasts were collected for use by centrif ugation. 

5. Peptide coating of lymphoblasts : Coating of the LPS 
activated lymphoblasts was achieved by incubating 3 0x10^ 
lymphoblasts with 100 fig of peptide in 1 ml of RIO medium for 
1 hr at 37* C. Cells were then washed once and resuspended in 
RIO mediiom at the desired concentration for use in in vitro 
CTL activation. 

6 . Peptide coating of Jurkat A2/K^ cells : Pep t ide 
coating was achieved by incxabating 10x10^ irradiated (20,000 
rads) Jurkat A2.l/K^ cells with 20 fig of peptide in 1 ml of 
RIO medium for 1 hour at 37 'C. Cells were washed three times 
and resuspended at the required concentration in RIO medium. 

7. In Vitro CTL activation : One to four weeks after 
priming spleen cells (5 x 10^ cells/well or 30x10^ cells/T25 
flask) were concultured at 37 'C with syngeneic, irradiated 
(3,000 rads) , peptide coated lymphoblasts (2x10^ cells/well or 
10x10^ cells/T25 flask) in RIO medium to give a final volume 
of 2 ml in 24 -well plates or 10 ml in T25 flasks. 

8. Restimulation of effector cells : Seven to ten days 
after the initial in vitro activation, described in paragraph 
7 above, a portion of the effector cells were restimulated 
with irradiated (20,000 rads), peptide- coated Jurkat A2/K^ 
cells (0.2x10^ cells/well) in the presence of 3x10^ "feeder 
cells "/well {C57B1/6 irradiated spleen cells) in RIO medium 
supplemented with 5% rat ConA supernatant to help provide all 
of the cytokines needed for optimal effector cell growth. 
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9. Assay for cvtocoxic activity : Target ceils (3x10°) 

were incubated at 37 'C in the presence of 200 /xl of sodium 
^^Cr chromate. After 60 minutes, cells were washed three 
times and resuspended in RIO medium. Peptides were added at 
5 the required concentration. For the assay, 10^ ^^Cr- labeled 

target cells wee added to different concentrations of effector 
cells (final volume of 200 /il) in U-bottom 96-2311 plates. 
After a 6 -hour incubation peric ..t 37*C, 0.1 ml aliquot s of 
supernatant were removed from eacr. well and radioactivity was 

10 detemined in a Micrcmedic automatic gainma counter. The 

percent specific lysis was determined by the forrnula: percent 
specific release = 100x( experimental release - spontaneous 
release) / (maximum release - spontaneous release). Where 
peptide titrations wee performed, the antigenicity of a given 

15 peptide (for comparison purposes) was expressed as the peptide 
concentration required to induce 40% specific ^^Cr release at 
a given E:T. 

Transgenic mice were injected subcutaneously in the 
base of the tail with an incomplete Freund's adjuvant emulsion 

20 containing 50 nM of the putative CTL epitopes containing the 

A2.1 motifs, and 50 nM of a hepatitis B core T helper epitope. 
Eight to 20 days later, animals were sacrificed and spleen 
.^lls were restimulated in vitro with syngeneic LPS 
lymphoblasts coated with the putative CTL epitope. A source 

25 of IL-2 (rat con A supernatant) was added at day 6 of the 
assay to a final concentration of 5% and" CTL activity was 
measured on day 7, The capacity of these effector T cells to 
lyse peptide -coated target cells that express the A2 KB 
molecule (Jurkat A2 KB) was measured as lytic units. The 

30 results are presented in T::J3le 24, 

The results of -his experiment indicate .that those 
peptides having a binding of at least 0.01 are capable of 
inducing CTL. All of the peptides in Appendices 1 and 2 
having a binding of at least about 0.01 would be immunogenic. 



35 
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TABLE .24 
Binding and Immunogenicity 
HBV Polymerase (ayw) 

CTL 



Peptide 












Binding** 


Activity 


Algorithm 


12 3 4 


5 


6 


7 


8 


9 








F L L S 


L 


G 


I 


H 


L 


0.52 


63 


-20.8 


G L Y S 


S 


T 


V 


P 


V 


0.15 


10 


-21.9 


H L y S 


H 


P 


I 


I 


L 


0.13 


10 


-21.1 


W I L R 


G 


T 


s 


F 


V 


0.018 


- + 


-20.9 


N L S W 


L 


S 


L 


D 


V 


0.013 


6 


-24.7 


L L S S 


N 


L 


S 


W 


L 


0.005 




-21.7 


N L Q S 


L 


T 


N 


L 


L 


0.003 




-23 .9 


H L L V 


G 


S 


S 


G 


L 


0.002 




-24.7 


L L D D 


E 


A 


G 


P 


L 


0.0002 




-25.5 


P L E E 


E 


L 


P 


R 


L 


0.0001 




-26.1 


D L N L 


G 


N 


L 


N 


V 


. * 




-25.7 


N L y V 


S 


L 


L 


L 


L 






-23 .6 


P L P I 


H 


T 


A 


E 


L 






-25.04 



*-=<0.0001 

** Relative binding capacity compared to std with IC5Q = 52mM 
XXX Lytic iinits/10^ cells; 1 lytic unit = the number of 
effector cells required to give 30% Cr^^ release, 
no measurable cytotoxic activity. 
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Example 11 
Idenrif ication of immunogenic peptides 
Using the motifs identified above for HLiA-A2.1 
allele amino acid sequences from a tumor- related protein, 
5 Melanoma Antigen- l (MAGE-l) , were analyzed for the presence of 
these motifs. Sequences for the target antigen are obtained 
from the GenBank data base (Release No. 71.0; 3/92) . The 
identification of motifs is done using the " FIND PATTERNS " 
program (Devereiix et al., Nucleic Acids Research 12:387-395 
10 (1984) ) . 

Other viral and tumor- related proteins can also be 
cuialyzed for the presence of these motifs. The amino acid 
sequence or the nucleotide sequence encoding products is 
obtained from the GenBank database in the cases of Human 

15 Papilloma Virus (HPV) , Prostate Specific antigen (PSA) , p53 

oncogene, Epstein Barr Nuclear Antigen-1 (EBNA-1) , and c-erb2 
oncogene (also called HER-2/neu) . 

In the cases of Hepatitis B Virus (HBV) , Hepatitis C 
Virus (HCV) , sind Human Iirmunodef iciency Virus (HIV) several 

20 strains/isolates exist and many sequences have been placed in 
GenBank . 

For HBV, binding motifs are identified for the adr, 
adw and ayw types. In order to avoid replication of identical 
sequences, all of the adr motifs and only those motifs from 

25 adw and ayw that are not present in adr are added to the list 
of peptides. 

In the case of HCV, a consensus sequence from 
residue 1 to residue 782 is derived from 9 viral isolates. 
Motifs are identified on those regions that have no or very 

30 little (one residue) variation between the 9 isolates. The 
sequences of residues 783 to 3 010 from 5 viral isolates were 
also analyzed. Motifs common to all the isolates are 
identified and added to the peptide list. 

Finally, a consensus sequence for HIV type 1 for 

35 North American viral isolates (10-12 viruses) was obtained 
from the Los Alamos National Laboratory database (May 1991 
release) and analyzed in order to identify motifs that are 
constant throughout most viral isolates. Motifs that bear a 
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small degree of variacion (one residue, in 2 forms) were also 
added to the peptide list. 

Appendices 1 and 2 provide the results of searches 
of the following antigens cERB2 , EBNAl, HBV, HCV, HIV, HPV, . 
5 MAGE, p53, and PSA. Only peptides with binding affinity of at 
least 1% as compared to the standard peptide in assays 
described in Example 5 are presented. Binding as compared to 
the standard peptide is shown in the far right column. The 
column la±>eled "Pos." indicates the position in the antigenic 
10 protein at which the sequence occurs . 

Example 12 
Identification of immunogenic peptides 
Using the motifs disclosed here, amino acid 
sequences from various antigens were screened for further 
15 motifs. Screening was carried out as described in Example 11. 
Tables 25 and 26 provide the results of searches of the 
following antigens cERB2 , CMV, Influenza A, HBV, HIV, HPV, 
MAGE, p53, PSA, Hu S3 ribosomal protein, LCMV, and PAP. Only 
peptides with binding affinity of at least 1% as compared to 
20 the standard peptide in assays described in Exainple 5 are 
presented. Binding as compared to the standard peptide is' 
shown for each peptide. 
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TABLE 25 



Sequence 


Antigen 


Molecule 


A2 
Bind. 


KIFGSLAFL 


C-ERB2 




0 . 1500 


RILHNGAYSL 


C-ERB2 




0 . 0180 


IISAWGILL 


C-ERB2 




0 . 0120 


MMWFWLTV 


CMV 




0.7600 


YLLLYFSPV 


CMV 




0.7500 


YLYRLNFCL 


CMV 




0 . 7200 


FMWm*VTL 


CMV 




0 . 6800 


LLWWITILL 


CMV 




0 .4900 


GLWCVLFFV 


CMV 




0 .4700 


LMIRGVLEV 


CMV 






LLLCRIiPFL 


CMV 




w • ^ A u w 


AMSRNLFRV 


CMV 




0 • 1500 


AMLTACVEV 


CMV 




0 . 1000 


RLQPNVPLV 


CMV 




0 . 0480 


VIARTFTPV 


CMV 






RLLRGLIRL 


CMV 




0 . 0370 


WMWFPSVLL 


CMV 




w . W J O U 


YLCCGITLL 


CMV 




0 . 0210 


SLLTEVETYV 


FLU-A 


Ml 


0 . 0650 


LLTEVETYV 


FLD-A 


Ml 


0 . 2000 


LLTEVETYVL 


FLU-A 


Ml 


0 . 0130 


GILGFVFTL 


FLU-A 


Ml 


0.1900 


GILGFVFTLT 


FLU-A 


Ml 


0.0150 


ILGFVFTLT 


FLU-A 


Ml 


0.2600 


ILGFVFTLTV 


FLU-A 


Ml 


0.0550 


ALASCMGLI 


FLU-A 


Ml 


0.0110 


RMGAVTTEV 


FLU-A 


Ml 


0.0200 
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Table 2 5 (Cont'd) 



Sequence 


Antigen 


Molecule 


A2 
Bind. 


VTTEVAFGL 


FLU-A 


Ml 


0.0360 


MVTTTNPLI 


FLU-A 


Ml 


0.0150 


FTFSPTYKA 


HBV 


POL 


0.0190 


YLHTLWKAGI 


HBV 


POL 


0.0280 


LMLQAGFFLV 


HBV (a) 


ENV(a) 


0.6300 


RMLTIPQSV 


HBV (a) 


ENV(a) 


0.0580 


SLDSWWTSV 


HBV (a) 


ENV(a) 


0.1000 


FMLLliCIilFL 


HBV(a) 


ENV(a) 


0.0450 


LLPFVQWFV 


HBV (a) 


ENV(a) 


0.6500 


LMPFVQWFV 


HBV (a) 


BNV(a) 


0.8300 


FLGIiSPTVWV 


HBV (a) 


ENV(a) 


0.0300 


SMLSPFLFLV 


HBV(a) 


ENV(a) 


0.9700 


GLWIRTPPV 


HBV(a) 


ENV(a) 


0.3600 


NLGNIiNVSV 


HBV(a) 


ENV(a) 


0.0160 


YLHTLWKAGV 


HBV (a) 


POL (a) 


0.1500 


RLTGGVFLV 


HBV(a) 


POL (a) 


0.1600 


RMTGGVFIiV 


HBV(a) 


POL (a) 


0.1500 


RLTGGVFLV 


HBV (a) 


ENV(a) 


0.1600 


ILGLI/GFAV 


HBV<a) 


ENV(a) 


0.0600 


GLCQVFADV 


HBV(a) 


ENV(a) 


0.0300 


WLLRGTSFV 


HBV (a) 


BNV{a) 


0.1000 


YLPSAIiNPV 


HBV (a) 


ENV(a} 


0.3200 


LLVPFVQWFA 


HBV adx 




0.2600 


FLPSDFFPSI 


HBV adr 




0.2100 


WSYVNVNM 


HBV adr 




0.0100 


HLPDRVHFA 


HBV adr 




0.0160 


SLAFSAVPA 


HBV adr 




0.0340 
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Table 25 (Cont'd) 



Sequence 


Antigen 


Molecule 


A2 
Bind. 


FLLTKILTI 


HBV adw 




0.6300 


SLYIIILSPFM 


HBV adw 




0.0440 


CLFHIVNLI 


HBV adw 




0.2100 


RLPDRVHFA 


HBV adw 




0.0940 


ALPPASPSA 


HBV adw 




0.0710 


GLLGWSPQA 


HBV ayw 




0.8650 


FLGPLLVLQA 


HEV ayw 




0.0190 


FLLTRILTI 


H5V ayw 




0.9300 


GMLPVCPLI 


HBV ayw 




0.0520 


QLFHLCLII 


HBV ayw 




0.0390 


KLCLGWLWGM 


HBV ayw 




0.0210 


LLWFHISCLI 


HBV ayw 




0.0130 


YLVSFGVWI 


HBV ayw 




2.7000 


LLSDWGPCA 


HBV ayw 




0.0180 


KLHLYSHPI 


HBV ayw 




0.2900 


FLLAQFTSA 


HBV ayw 




0.6600 


LLAQFTSAI 


HBV ayw 




9.6000 


YMDDWLGA 


HBV ayw 




0.1600 


AUIPLYACI 


HBV ayw 




0.2000 


GLCQVFADA 


HBV ayw 




0.0X80 


HLPDLVHFA 


HBV ayw 




0.1100 


RLCCQLDPA 


HBV ayw 




0.0290 


AUIPLYACI 


HBV ayw 
polymerase 




0.5000 


FLCKQYLNL 


HBV ayw 

polymerase 

665-673 




0.0210 
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Table 25 (Conr'd) 



Sequence 


Antigen 


Molecule 


A2 
Bind. 


SLYADSPSV 


HBV 

polymerase 




0.3500 


ALMFLYASI 


HBV 

polymerase 




0.0760 


NLNNLNVSI 


HBV 

polymerase 




0.0660 


ALSLIVNliL 


HBV 

polymerase 




0.0470 


KLHLYSHPI 


HBV 

polymerase 




0.2900 


WILRGTSFV 


HBV 

polymerase 
1344-1352 




0.0270 


ZiVLQAGFFLL 


HBVacar 


BNV 


0.0150 


FIT.T.T.CXIPX 


HBVadr 


ENV 


0.0280 


WILRGTSFV 


KBVadr 


POL 


0.0180 


IISCTCPTV 


HBVadw 


PreCore 


0.0190 


LVPFVQWFV 


HBVadw 


ENV 


0.0200 


LIISCSCPTV 


HBVadw 


CORE 


0.0290 


FLPSDFFPSI 


HBVayr 


PreCore 


0.2100 


LLCLGWLWGM 


HBVayr 


PreCore 


0.0220 


QLFHLCIill 


HBVayw 


PreCore 


0.0390 


CLGWLTGMDI 


HBVayw 


PreCore 


0.0190 


FLGGTTVCIj 


HBVayw 


ENV 


0.1700 


SLYSILSPFL 


HBVayw 


ENV 


0.2000 


FLPSDFFPSV 


HBVayw 


CORE 


1.5000 


ILCWGELMTL 


HBVayw 


CORE 


0.1900 


IlinATWVGV 


HBVayw 


CORE 


0.6800 


TIATWVGVNL 


HBVayw 


CORE 


0-5700 
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Table 25 (Cont'd) 



ion 


>UiU X ^ CXI 


L X\J X C U X C 


A2 




rxo vayw 


POT. 


w • x^ u u 






POT. 


U . X / u u 


RMRGTPe JVP 
ru^xvw A f IT fi r i 1 






U . U X X V/ 


SLYADSPSV 


fio w ay w 


POL 


b "^^on 

U . J 3 U W 


YLYGVRSAV 


HCV 




U • X D UU 


LLSTTEWQV 


HCV 




0.0480 


IIGAETFYV 


HIV 


FOL 


0.0260 


QLWVTVYYGV 


HIV 


ENV 


0.0250 


NLWVTVYYGV 


HIV 


ENV 


0.0160 


KLWVTVYYGrV 


HIV 


ENV 


0.0150 


KLWVTVYYGV 


HIV.MN 
9PXbU 




0.0150 




XlAr VXD 


& / 


X . 4UUU 


•FT /? 1 Wf 'DT 


xiir vx b 


a / 


A ccnn 

U . O3U0 


YIiLDIjOEPV 

XX4X4WX/l^CU<r V 


nxr V X D t a/ 


•b / laj 


n '9^ HA 
U . ^^UU 


VMT .nx J^DITTT 
xgitil^XJ<|<"** V 


tram c / A \ 

HPvxb (a/ 


B7 (a) 


1 .9000 


vl^fa^xr^rB 1 X 


xllrVXbb / 


b/ 


0 . 0X30 


OXi^iJ XCi X X ^ V X ^.I^X V 


worn Q 


PC 


0 . 0100 


PT.T.TgT.PTTT. 


WCT7 

no V 




A ? il A A 

0 . 3400 




wcv 
xia V 




A "a A A A 

U • 3400 


T.T.T.WnVQT. 


CXO V 




A 1 DA A 
U . ^DOO 


DMLGRVFFV 


HSV 




0 .0110 


TOFEALPHI 


LCM7 


Gp 


0 .2300 


alisfll:a 


LCMV 


Gp 


0.-1:00 


TLMSIVSSL 


LCMV 


Gp 


0.2000 


NISGYNFSL 


LCMV 


Np 


0.0280 


ALLDGGNML 


LCMV 


Np 


0.0320 
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Table 25 (Cont'd) 



Sequence 


Anrigen 


Molecule 


A2 
Bind. 


ALHLFKTTV 


LCMV 


Gp 


0.0170 


SXilSDQLLM 


LCMV 


Gp 


0 .0540 


WLVTNGSYL 


LCMV 


Gp 


0.0180 




LCMV 


Gp 


0.4300 




LCMV 


Gp 


0.0460 


LMFSTSAYL 


LCMV 


Gp 


0.3600 


YLVSIFLHL 


LCMV 


Gp 


0.4200 


SLHCKPEEA 


MAGEl 




0.0130 


ALGLVCVQA 


MAGEl 




0.0150 


LVIiGTTiEEV 


MAGEl 




0 .0320 


GTIuEEVPTA 


MAGEl 




0.0130 


CILESLFRA 


KAGEl 




0.0460 


KVADLVGFIiL 


MAGEl 




0.0560 


KVMLVGFLLL 


MAGEl 




0.0200 


VMIAKEGGHA 


MAGEl 




0.0360 


SMHCKFEEV 


MAGEl (a) 




0.0180 


AMGLVCVQV 


MAGEl (a) 




0.0120 


LMLGTLEEV 


MAGEl (a) 




0.1300 


KHADIiVGFLV 


MAGEl (a) 




1.5000 


VMVTCliGLSV 


MAGEl (a) 




0.3000 


LLGDNQIMV 


MAGEl (a) 




0.0430 


QMMPKTGFLV 


MAGEl (a) 




0.0500 


VMTAMEGGHV 


MAGEl (a) 




0.0530 


WMEI*SVMEV 


MAGEl (a) 




0.0410 


FLWGPRALA 


MAGEIN 




0.0420 


RAIAETSYV 


MAGEIN 




0.0100 


AEAETSYVKVIi 


MAGEIN 




0.0120 
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Table 25 (Cont'd) 



Sequence 


Antigen 


Molecule 


A2 
Bind . 


ALAETSYVKV 


MAGEIN 




0 . 0150 


KVLEYVTKV 


MAGEIN 




0 . 0900 


YVTKVSARV 


MAGE IN 




0 . 0140 


AIiREEEZ' " 


MAGEIN 




0 . 0210 


YMFLWGPRV 


MAGEIN (a) 




0 . 2200 


KMVELVHFLLL 


MAGE2 




0 . 6700 


KMVELVHFL 


MAGE2 




0 . 1600 


KMVELVHFIiL 


MAGE2 




0 . 1100 


KASEYLQLV 


MAGE2 




0 . 0110 


YLQLVFGIEV 


MAGS2 




0 .3700 


LVFGIEWEV 


MAGE2 




0 . 0120 


QLVFGIEIMEV 


MAGE3 




0 • 3400 


KVAELVHFL 


MAGE3 






KVTAELVHFUi 


MAGE3 






ELMEVDPIGHL 


MAGE3 




n 0260 


HLYIFATCLGL 


MAGE3 




0 • 0410 


IMPKAGI4LIIV 


MAGE3 




0 . 0130 


LVFGIELMEV 


MAGE3 




0 . 1100 


ALGRNSFEV 


p53 264-272 A8 (Al) 


0 - 0570 


LLGANSFEV 


p53 264-272 A8 (A4) 


0 . 1100 


LLGRASFEV 


p53 264-272 A8 (A5) 


0 • 2200 


LLGRNAFEV 


p53 264-272 AS (A6) 


0.0390 


LLGRNSFAV 


p53 264-272 A8 (A8) 


0.0420 


RLGRNSFEV 


p53 264-272 A8 (Rl) 


0.0190 


LLGRRSFEV 


p53 264-272 A8 (R5) 


0.0540 


LLGRKSFRV 


p53 264-272 A8 (R8) 


0.0250 


LLFFWLDRSV 


PAP 




0.6000 
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Table 25 (Conc'di 



Sequence 


Antigen 


Molecule 


A2 
Bind. 


VIAKEUCFV 


PAP 




0.0590 


ILiLWQPIPV 


PAP 




1.3000 


IMySAHDTTV 


PAP 




0.0610 


FLTLSVTWI 


PSA 




0.0150 


FLTLSVTWIGA 


PSA 




0 .0160 


FLTLSVTWI 


PSA 




0.0150 


VLVHPQWVLTA 


PSA 




0.0130 


SLFHPEDTGQV 


PSA 




0.0190 


MLIiRLSEPAEL 


PSA 




0 .1400 


ALGTTCYA 


PSA 




0.0230 


KLQCVDIiHVT 


PSA 




0.0370 


FLPSDYFPSV 


KBVC18-27 analog 


1.0000 


YSFLPSDFFPSV 


HBVC18-27 analog 


0.0190 
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Table 26 



Sequence 


Antigen 


Molecule 


A2 

Bind. 


ALFLGFLGAA 


HIV 


gpl60 


0.4950 


MLQLTVWGI 


HIV 


gpl60 


0.2450 


RVTEVLQRA 


HIV 


gpl60 


0.1963 


KLTPLCVTL 


HIV 


gpl60 


0.1600 


LLIAARIVEL 


HIV 


gpl60 


0.1550 


SLLNATDIAV 


HIV 


gpl60 


0.1050 


ALFLGFLGA 


HIV 


gpl60 


0.0945 


HMLQLTVWGI 


HIV 


gpl60 


0.0677 


LLNATDIAV 


HIV 


gpl60 


0.0607 


ALLYKLDIV 


HIV 


gpl60 


0.0362 


WLWYIKIFI 


HIV 


gpl60 


0.0355 


TIIVHLNESV 


HIV 


gpl60 


0.0350 


LLQYWSQEL 


HIV 


gpieo 


0.0265 


imVGGLVGL 


HIV 


gpl60 


0.0252 


LLYKLDIVSI 


HIV 


gpl60 


0.0245 


FLAIIWVDL 


HIV 


gpl60 


0.0233 
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Table 2 6 (Conc'd) 



TliQCKIKQII 


HIV 


gpl60 


0.0200 ' 


GLVGLRIVFA 


HIV 


gpl60 


0.0195 


FliGAAGSTM 


HIV 


gpl60 


0.0190 


IISLWDQSL 


HIV 


gpl60 


0,0179 


TVWGIKQLQA 


HIV 


gpl60 


0 .0150 


LliGRRGWEV 


HIV 


gpl60 


0 .0142 


AVLSIVNRV 


HIV 


gpl60 


0.0132 


FIMTVGGLV 


HIV 


gpl60 


0.0131 


If} >rj AX 1 1 J xj^yj^ 


XIX. V 




0 . 0117 


FIiYGALLLA 


PliP 




1.9000 


SLLTFMIAA 


PLP 




0.5300 


FMIAATYNFAV 


PliP 




0 .4950 


RMY-GVIiPWI 


PLP 




0.1650 


lAATYNFAV 


PliP 




0.0540 


GliLECCARCI^V 


PLP 




0.0515 


YALiTWWLL 


PliP 




0.0415 


ALTWWLLV 


PliP 




0.0390 


FLYGALLL 


PliP 




0.0345 


SLCADARMYGV 


PliP 




0.0140 


LLVFACSAV 


PLP 




0.0107 
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Table 2 6 (Cont'd) 



Sequence 


Antigen 


A2 


KMVELVHFLL 


MAGE2 


0.2200 


KV3;ELVHFL 


KAGE3 


0.0550 


RALAETSYV 


MAGEIN 


0.0100 


LVFGIELMEV 


KAGE3 


0.1100 


FLWGPRALA 


MAGEIN 


0.0420 


ALAETSYVKV 


MAGEl 


0.0150 


LVLGTLEEV 


HIV 


0.0320 


LLWKGEGAW 


HIV 


0.0360 


IIGAETFYV 


HIV 


0.0260 


LMVTVYYGV 


HIV 


0.4400 


LLFNILGGWV 


HCV 


3.5000 


LLALLSCLTV 


HCV 


0.6100 


YLVAYQATV 


HCV 


0.2500 


FLLIADARV 


HCV 


0.2300 


ILA6YGAGV 


HCV 


0.2200 


YLLPRRGPRL 


HCV 


0.0730 


GLIiGCIITSL 


HCV 


0.0610 


DLMGYIPLV 


HCV 


0.0550 


LLALLSCLTX 


HCV 


0.0340 


VLAALAAYCL 


HCV 


0.0110 


LLVPFVQWFV 


HBV 


1.6000 


FLLAQFTSA 


HBV 


0.6600 


FLLSLGIKL 


HBV 


0.5200 


ALMPLYACI 


HBV 


0.5000 


ILLLCLIFLL 


HBV 


0.3000 


LLPIFFCLWV 


HBV 


0.1000 


YLHTLWKAGI 


HBV 


0.0560 
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Table 2 6 (Cont ' d) 



YLHTLWKAGV 



HBV 



0 . 1300 
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Example 13 
Identification of immunogenic peptides 
in autoantiaens 
As noted above, the motifs of the present invention 
5 can also be screened in antigens associated with autoimmune 
diseases. Using the motifs identified above for HLA-A2.1 
allele amino acid sequences from myelin proteolipid (PLP) , 
myelin basic protein (MBP) , glutamic acid decarboxylase (GAD) , 
and human collagen types II and IV were analyzed for the 
10 presence of these motifs. Sequences for the antigens were 
obtained from Trifilieff et al., C.R. Sceances Acad. 5ci. 
300:241 (1985); Eyler at al., J. Biol. Chem, 246:5770 (1971); 
Yamashita et al. Biochiem. Biophys. Res. Comm. 192:1347 
(1993); Su et al.. Nucleic Acids Res. 17:9473 (1989) and — 
15 Pihlajaniemi et al. Proc. Natl. Acad. Sci. USA 84:940 (1987). 
The identification of motifs was done using the approach 
described in Example 5 and the algorithms of Examples 6 and 7. 
Table 27 provides the results of the search of these antigens. 
Using the quantitative binding assays of Example 4, 
20 the peptides are next tested for the ability to bind MHC 
molecules. The ability of the peptides to suppress 
proliferative responses in autoreactive T cells is carried out 
using standard assays for T cell proliferation. For instance, 
methods as described by Miller et al. Proc. Natl. Acad. Sci. 
25 USA, 89:421 (1992) are suitable. 

For further study, animal models of autoimmune 
disease can be used to demonstrate the efficacy of peptides of 
the invention. For instance, in HLA transgenic mice, 
autoimmune model diseases can be induced by injection of MBP, 
30 PLP or spinal cord homogenate (for MS) , collagen (for 

arthritis). In. addition, some mice become spontaneously 
affected by autoimmune disease (e.gr. , NOD mice in diabetes) . 
Peptides of the invention are injected into.. the appropriate 
animals, to identify preferred peptides. 



35 
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TABLE 27 
Human PLP peptides 

Pos AA ?1 P2 P3 P4 P5 P6 P7 P8 P9 PIO Allele Motif 



3 


9 


Li 


L 


E 


C 


C 


A 


R 


C 


L 




23 


9 


G 


L 


C 


F 


F 


G 


V 


A 


L 




39 


9 


A 


L 


T 


G 


T 


E 


K 


L 


I 




134 


9 


S 


L 


E 


R 


V 


C 


H 


C 


L 




145 


9 


w 


L 


G 


H 


P 


D 


K 


F 


V 




158 


9 


A 


L 


T 


V 


V 


W 


L 


L 


V 




164 


9 


L. 


L 


V 


F 


A 


C 


S 


A 


V 




205 


9 


R 


M 


Y 


G 


V 


L. 


P 


W 


I 




2 


10 


G 


L 


L 


E 


c 


C 


A 


R 


c 


L 


3 


10 


L 


L 


E 


C 


c 


A 


R 


C 


L 


V 


10 


10 


C 


L 


V 


G 


A 


P 


F 


A 


S 


L 


163 


10 


w 


L 


L 


V 


F 


A 


C 


S 


A 


V 


250 


10 




L 


V 


s 


L 


L 


T 


F 


M 


I 


64 


9 


V 


I 


H 


A 


F 


Q 


Y 


V 


I 




80 


9 


F 


L 


Y 


G 


A 


L 


L 


L 


A 




157 


9 


Y 


A 


L 


T 


V 


V 


W 


L 


L 




163 


9 


W 


L 


L 


V 


F 


A 


c 


S 


A 




234 


9 


Q 


M 


T 


F 


H 




F 


I 


A 




251 


9 




V 


S 


L 


L 


T 


F 


M 


I 




253 


9 


s 


L 


L 


T 


F 


M 


I 


A 


A 




259 


9 


I 


A 


A 


T 


Y 


N 


F 


A 


V 




84 


10 


A 




L 


L 


A 


E 


G 


F 


Y 


T 


157 


10 


Y 


A 


L 


T 


V 


V 


W 


L 


L 


V 


165 


10 




V 


F 


A 


c 


s 


A 


V 


P 


V 


218 


10 


K 


V 


C 


G 


s 


N 


L 


L 


S 


I 


253 


10 


S 


L 


L 


T 


F 


M 


I 


A 


A 


T 



A2.1 (LM)2; (LVDc 



Algorithm 



Table 27 continued 
Human Collagen TypelV peptides 



Pos 


AA 


PI 


P2 


P3 


P4 


P5 


P6 


P7 


P8 


P9 


PIO Allele 


Motif 


5 


9 


A 


L 


M 


G 


P 


L 


G 


L 


L 


A2.1 


(LM)2; (LVT) 


11 


9 


G 


L 


L 


G 


Q 


X 


G 


P 


L 






23 


9 


G 


M 


L 


G 


Q 


K 


G 


E 


I 






231 


9 


P 


L 


G 


Q 


D 


G 


L 


P 


V 






3 


10 


T 


Xi 


A 


L 


M 


G 


P 


L 


G 


L 




24 


10 


M 


L 


G 


Q 


K 


G 


E 


I 


G 


L 




59 


10 


P 


L 


G 


K 


D 


G 


P 


P 


G 


V 




139 


10 


P 


L 


G 


L 


P 


G 


A 


S 


G 


L 





Human Collagen Typell peptides 



Pos AA PI P2 P3 P4 P5 P6 P7 P8 P9 PIO Allele Motif 

794 9GLAGQRG.IV A2.1 (l-M) 2 ; (LVDc 

17 9VMQGPMGPM Algorithm 
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Table 27 continued 
Human GAD peptides 



Pos 


AA 


PI 




P3 


P4 


P5 


P6 


P7 


P8 


P9 


PIO j-wllele Motir 


56 


9 


s 




& 


E 


K 


S 


R 


L 


V 


A2.1 (LM)2; (LVT) c 


116 


9 


F 


T 

Li 


T 


£ 


V 


V 


D 


I 


L 




117 


9 


T 

Li 


T 


h> 


V 


V 


D 


I 


L 


L 




150 


9 


G 


w 

rl 




G 


F 


N 


L 


E 


L 




157 


9 


£ 


T 

Li 


o 


D 


H 


P 


E 


S 


L 




168 


9 


X 


T 

Li 


V 


D 


C 


R 


D 


T 


L 




190 


9 


Q 


T 


b 


T 


G 


L 


D 


I 


I 




223 


A 

9 


T 


T 

Ij 


K 


K 


M 


R 


E 


I 


V 




275 


9 


G 


11 


A 


A 


V 


P 


K 


L 


V 




300 


A 

9 


A 


T 

U 


G 


F 


G 


T 


D 


N 


V 


A2.1 (LM)2; (LVI) c 


409 


9 


V 


T 
JJ 


T 


f\ 
W 


C 


S 


A 


I 


L 


410 


A 

9 




T 
ll 


r\ 

Q 


C 


s 


A 


I 


L 


V 




416 


A 

9 


T 
i. 


T 
1* 


V 


K 


£ 


K 


G 


I 


L 




466 


A 

9 


L 


w 

£1 


W 


K 


A 


K 


G 


T 


V 




C O X 

3 J4 


A 

9 




T 


n 


K 


V 


A 


P 


K 


I 




546 


A 

9 


n 


v# 

H 


c« 

A 


S 


G 


T 


T 


M 


V 






q 


p 


L 


T 


IT 


IT 


T 
X 


£ 


K 


T 
ij 




42 


10 


K 


L 


G 




K 


I 


C 


G 


F 


L 


116 


10 


F 


T 


L 


E 


V 


V 


D 


I 


L 


L 


138 


10 


V 


L 


D 


F 


H 


H 


P 


H 


Q 


L 


147 


10 


L 


li 


E 


G 


M 


E 


G 


F 


N 


L 


212 


10 


N 


M 


F 


T 


Y 


T? 


T 


a 




V 


275 


10 


G 


M 


A 


A 


V 


P 


K 


L 


V 


L 


300 


10 


A 


L 


G 


F 


G 


T 


D 


N 


V 


I 


328 


10 


I 


L 


E 


A 


K 


Q 


K 


G 


Y 


V 


381 


10 


L 


M 


S 


R 


K 


H 


R 


H 


K 


L 


409 


10 


V 


L 


L 


Q 


C 


s 


A 


I 


L 


V 


435 


10 


L 


L 


Q 


P 


D 


K 


Q 


Y 


D 


V 


465 


10 


W 


L 


M 


w 


K 


A 


K 


G 


T 


V 


485 


10 


£ 


L 


A 


E 


Y 


L 


Y 


A 


K 


I 


545 


10 


L 


M 


M 


E 


S 


G 


T 


T 


M 


V 


252 


9 


G 


A 


I 


S 


N 


M 


Y 


S 


I 


Algorithm 


367 


9 


N 


L 


W 


L 


H 


V 


D 


A 


A 


567 


9 


R 


H 


V 


I 


S 


N 


P 


A 


A 




299 


10 


A 


A 


L 


G 


F 


G 


T 


D 


N 


V 


406 


10 


M 


H 


G 


V 


L 


L 


Q 


C 


S 


A 


423 


10 


I 


L 


Q 


G 


C 


N 


Q 


H 


c 


A 
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Example 14 

Imrnnnoaenicity of HPV peptides in A2 . 1 transgenic mice 
A group of 14 HPV peptides, including 9 potential 
epitopes plus 3 low binding and one non-binding peptides as 
5 controls was screened for iinmunogenicity in HIiA-A2.1 

transgenic mice using the methods described in Example 10. To 
test the immunogenic potential of the peptides, HLA A2 . 1 
transgenic mice were injected with 50 /xg/mouse of each HPV 
peptide together with 140 ^g/mouse of helper peptide (HBV core 

10 128-140 (TPPAYRPPNAPIL) . The peptides were injected in the 

base of the tail in a 1:1 emulsion IFA. Three mice per group 
were used. As a positive control, the HBV polymerase 561-570 
peptide, which induced a . strong CTL response in previous 
experiments, was utilized. 

15 Based on these results (Taile 28), four unrelated 

peptides were considered to be the most immunogenic: TLGIVCPI 
, LLMGTLGIV, YMLDLQPETT, and TIHDIILECV. TLGIVCPI and 
YMLDLQPETT were found to be good HIiA-A2,l binders, while 
LLMGTLGIV and TIHDIILECV were found to be intermediate binders 

20 in previous binding assays. 
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TABIiE 28 

HPV-16 Pepcides for possible use in clinical trial 



5 


Peptide 

Ort c 4 t" i on / 

Cytel ID 


Sequence 


AA 


A2.1 
hindina 


Immunogeni c i ty 
Exp e r iinent 1 


Immunogeni ci ty 




E7.8S/ia8a.01 


TLGIVCPI 


8 


0.15 


94.4 (1.34) 


54.2 (1.43)* 




E7. 86/1088. 06 


TLGIVCPIC 


9 


0.075 


2.05 (4.93) 


1.3 (3.74) 


10 


E7. 85/1088. 08 


GTLGIVCPI 


9 


0.021 


9/08 (3.93) 






E7.il/1088.03 


YMLDLQPETT 


10 


0.15 


10.32 (1,66) 


5.7 (2.39) 




E7.il/1088.04 


YMLDLQPET 


9 


0.14 


5.0 (3.70) 


2.6 (15,5) 




E7. 12/1088. 09 


MLDLQPETT 


9 


0.0028 


- 


- 




E6. 52/1088. 05 


FAFRDLCIV 


9 


0.057 


- 


ND 


15 


E7. 82/1088. 02 


LLMGTLGIV 


9 


0.024 


9.62 (2.53) 


8.93 (1.91) 




E6. 29/1088. 10 


TIHDIILECV 


10 


0.021 


22.13 (3.71) 


0.4 (3.52) 




E7. 7/1088 .07 


TLHEYMLDL 


9 


0.0070 




1.2 (3.88 




E6. 18/1088. 15 


KLPQLCTEL 


9 


0.0009 




0.3 (5.64) 




E6. 7/1088. 11 


AMFQDPQER 


10 


0.0002 




ND 


20 


E6. 26/1088. 12 


LQTTIHDII 


9 


0.0002 








E7. 73/1088. 13 


HVDIRTLED 


9 


0 




ND 



* A Lytic Units, geometric mean x-f SD (3 mice/peptide) 

a dash indicates a Lytic Units with a geometric mean ^0.2 

25 
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Mixcurgg cf selected HPV epitopes 

A combination of CTL peptides and a helper peptide 
were tested for the ability to provide an increased immxine 
response. The four single peptides were injected separately 
5 in order to compare their immunogenicity to injections 
containing only the two good binders or only the two 
intermediate binders. In addition all four peptide were 
injected together. To further evaluate the immunogenicity of 
a combination of peptides with different binding affinity 

10 decreases, another control was introduced in this experiment. 
A mixture of the two good binders was injected in a different 
site than the mixture of the two intermediate binders into the 
base of the tail of the same mouse. All groups of CTL 
epitopes were injected together with the HBVc helper epitope, 

15 with the exception of two groups in which all four HPV 

coinjected with two different doses of a PADRE helper peptide 
(aKXVAAWTLKAAa, where a is d- alanine and X is 
cyclohexyl alanine) either 1/xg or 0 . 05/ig per mouse. 

All four peptides induced a strong CTL response when 

20 injected alone and tested using target cells labeled with the 
appropriate peptide (Table 29) . TLGIVCPI proved to be the 
strongest epitope, an observation confirming the results 
described above. When mixtures of all four peptides were 
injected and the responses were stimulated in vitro and tested 

25 with target cells pulsed with each single peptide, all 

combinations showed a strong CTL response. No significant 
difference was observed when the two helper epitopes were 
compared. This might in part be due to the fact that the 
highest dose of PADRE used in this experiment was 140- fold 

30 lower than the one for the HBV helper peptide. 

Injection of mixtures of the two good binders 
together or the two intermediate binders resulted in a very 
low CTL response in both cases even though the single peptides 
were highly effective. These results, however, are due to a 

35 very low nximber of cell recovery after splenocyte culture of 6 
days and are therefore regarded as preliminary. 
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TJ^BLE 29 



HPV Peptides single and in combinations 

A 

5 









Peptides in 


restimulation 


and CTL assay 




Peptide /s injected 


1088.01 


1088 .02 


1088 .03 


1086 .10 




same as in vitro 


116.1 (3.49)* 


55.98 (2.49) 


5.56 (1.75) 


16.4 (1.491 


10 


1088.01 + 
1088 . 0^ *^ 
875 .23 


± . J / \ iO .30/ 




0 (0) 






1088.02 + 
1088.10 + 
875 .23 




1.11 (2.9) 




1.62 (13.1) 


15 


1088. 01/. 03 + 
;.088.02/.10 + 
375.23 


19.5 (4.1) 


4.68 (2.3) 


1.13 (21.9) 


1.17 (2.58) 




1088. all 


107.9 (4.77) 


13.52 (1.4) 


2.58 (5.07) 


102.3 (1.32) 


20 


+ 

875 .23 












1088 .all 


73.11 (4.48) 


16.83 (2.54) 


3.55 (2.9) 


20.13 (1.05) 




PADRE 1 ug 










25 


1088. all 
PADRE 0.05 ^g 


37.15 (2.25) 


26.79(2.09) 


6.5 (1.64) 


4.45 (4.14) 



* A Lytic Units 30% geometric mean (+x deviation) 



30 Peptides were dissolved in 50%DMSO/H20 to reach a stock concentration of 
20mg/ml and were further dissolved in sterile PBS. For subcutaneous 
injection in the base of the tail of A2.1 transgenic mice, the peptide 
solution was mixed 1:1 with IFA. The injected amount of HPV-CTL peptides 
was 50 ^ig/mouse coinjected with 140 ^g/mouse of the HBVcore peptide 875.23 

35 or the indicated dose of PADRE (3 mice/group) . Spleens were removed on 
day 11 and splenocytes were restimulated in vitro with irradiated LPS- 
Blasts pulsed with the indicated HPV-CTL epitopes at l^g/ml. After six 
days, the cytotoxic assay was performed using Jurkat JA2Kb cells (A) or 
MBB17 (B) as target cells labelled with 51Cr in the presence or absence of 

40 the appropriate HPV epitope peptides. 
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The above examples are provided to illustrate tiie 
invention but not to limit its scope. Other variants of the 
invention will be readily apparent to one of ordinary skill 
the art and are encompassed by the appended claims. All 
pxiblications , patents, and patent applications cited herein 
are hereby incorporated by reference. 
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APPENDIX I: 9 -MER PEPTIDES 







HA ■ 




Strain 


* .Molecule :- 


; ;;;Pos ; 




1.0841 


ILSPFLPLL 


9 


HBV 


adr 


ENV 


371 


2.9 


1.0240 


TLQDIVLHL 


9 


HPV 


IS 


E7 


7 


0.76 


1.0838 


WLSLLVPFV 


9 


HBV 


adr 


ENV 


335 


0 .72 


1.0851 


FLLSLGIHL 


9 


HBV 


adr 


POL 


1147 


0 .52 


1.0306 


QLFEDNYAL 


9 


C-ERB2 






106 


0.46 


1.0814 


LMVTVYYGV 


9 


HIV 




. ENV 


2182 


0.44 


1.0878 


MMWFWGPSL 


9 


HBV 


adw 


ENV 


360 


0 .41 


1.0839 


HMtfYWGPSL 


9 


HBV 


adr 


ENV 


360 


0.41 


1.0384 


FLTKQYLNL 


9 


HBV 


adw 


POL 


1279 


0.29 


1.0321 


ILHNGAYSL 


9 


C-ERB2 






435 


0.21 


1.0834 


LLLCLIFLL 


9 


HBV 


adr 


ENV 


250 


0.19 


1.0167 


GLYSSTVPV 


9 


HBV 


adr 


POL 


635 


0.15 


1.0849 


HLYSHPIIL 


9 


HBV 


adr 


POL 


1076 


0.13 


1.0275 


RHPEAAPPV 


9 


PS3 






65 


0.12 


1.0854 


LLMGTLGIV 


9 


HPV 


16 


E7 


82 


0.11 


1.0880 


ILSPFMPLL 


9 


HBV 


adw 


ENV 


371 


0.11 


1.0127 


YLVAYQATV 


9 


HCV 




LORF 


1585 


0.11 


1.0151 


VLLDYQGML 


9 


HBV 


adr 


ENV 


259 


0.11 


1.0018 


VLAEAMSQV 


9 


HIV 




GAG 


367 


0.11 


1.0330 


RLLQETELV 


9 


C-ERB2 






689 


0.091 


1.0209 


SLYAVSPSV 


9 ' 


- HBV - 


adr 


POL ' 


1388 


0.078 


1.0816 


DLMGYIPLV 


9 


HCV 




CORE 


132 


0.055 


1.0835 


LLCLIFLLV 


9 


HBV 


adr 


ENV 


251 


0.049 


1.0852 


FLCQQYLHL 


9 


HBV 


adr 


POL 


1250 


0.048 
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APPENDIX I: 9 -MER PEPTIDES 









.;::;,:Virua::::Q 


|i;:;:S.tira;ih;-.. 


Moiecnile-.v 






1.0882 


NLYVSLMLL 


9 


HBV 


adw 


POL 


1088 


0 .046 


1.0837 


GMLPVCPLL 


9 


HBV 


adr 


SNV 


265 


0.046 


1 .0819 


ILPCSFTTL 


9 


HCV 




NS1/ENV2 


676 


0 . 045 


1.0109 


ALSTGLIHL 


9 


HCV 




NS1/ENV2 


686 


0.042 


1.0833 


ILLLCLIFL 


9 


HBV 


adr 


ENV 


249 


0.035 


1.0301 


HLYQGCQW 


9 


C-ERB2 






48 


0 .034 


1.0337 


CLTSTVQLV 


9 


C-ER£2 






789 


0 .034 


1.0842 


PLLPIFFCL 


9 


HBV 


adr 


ENV 


377 


0.031 


1.0861 


ALCRWGLLL 


9 


C-ERB2 






5 


0.031 


1.0309 


VLIQRNPQL 


9 


C-ERB2 






153 


0.029 


1.0828 


VLQAGFFLL 


9 


HBV 


adr 


BNV 


177 


0.024 


1.0844 


LLWFHISCL 


9 


HBV 


adr 


CORE 


490 


0.024 


1.0135 


lUlGYGAGV 


9 


HCV 




LORF 


1851 


0.024 


1.0870 


QLMPYGCLL 


9 


C-ERB2 






799 


0..023 


1.0075 


LLWKGEGAV 


9 


HIV 




POL 


1496 


0.023 


1.0873 


FliGGTPVCL 


9 


HBV 


adw 


ENV 


204 


0.021 


1.0323 


ALIHHNTHI* 


9 


C-ERB2 






466 


0.021 


1.0859 


VliVHPQWVL 


9 


PSA 






49 


0.020 


1.0267 


KLQCVDUIV 


9 


PSA 






166 


0.019 


1.0820 


VLPCSFTTL 


9 


HCV 




NS1/ENV2 


676 


0.017 


1.0111 


HLHQNIVDV 


9 


HCV 




NS1/ENV2 


693 


0.016 


1.0103 


SMVGNWAKV 


9 


HCV 




KNVl 


364 


0.016 


1.0283 


LLGRNSFEV 


9 


P53 






264 


0 .014 


1.0207 


GLYRPLLSIi 


9 


HBV 


adr 


POL 


1370 


0 .014 
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APPENDIX I: 9 -MER PEPTIDES 





;::SaqiiBiice vi-; 






V Strain:-..' 




■•■■Pos'.::. 


v;tiL2:i..- 


1.0389 


GLYRPLLRL 


9 


HBV 


adw 


POL 


1399 


0 .014 


1.0185 


NLSWLSLDV 


9 


HBV 


adr 


POL 


996 


0.013 


1.0113 


FLLLADARV 


9 


HCV 




NS1/ENV2 


725 


0.013 


1.0119 


YLVTRHADV 


9 


HCV 




LORF 


1131 


0.011 


1.0846 


CLTHIVNLL 


9 


HBV 


adr 


POL 


912 


0.010 


1.0156 , 


ELMNIiATWV 


9 


HBV 


adr 


CORE 


454 


0.010 


1.0236 


KLPDLCTEL 


9 


HPV 


18 


E6 


13 


0.010 


1.0056 


ALQDSGLEV 


9 


HIV 




POL 


1180 


0.0083 


1.0375 


LLSSDLSWL 


9 


HBV 


adw 


POL 


1021 


0.0081 


1.0094 


ALAHGVRVL 


9 


HCV 




CORE 


150 


0.0072 


1.0129 


TLHGPTPLL 


9 


HCV 




lorf' 


1617 


0.0070 


1.0041 


KLLRGTKAL 


9 


HIV 




POL 


976 


0.0069 


1.0131 


CMSADLBW 


9 


HCV 




LORF 


1648 


0.0067 


1.0872 


GLLGPLLVL 


9 


HBV 


adw 


KNV 


170 


0.0066 


1.0228 


TLHEYULDL 


9 


HPV 


16 


E7 


7 


0.0059 


1.0274 


KLLPENNVL 


9 


p53 






24 


0.0058 


1.0043 


ILKEPVHGV 


9 


HIV 




POL 


1004 


0.0055 


1.0206 


RLGLYRPLL 


9 


HBV 


adr 


POL 


1368 


0.0050 


1.0188 


GLPRYVARL 


9 


HBV 


adr 


POL 


1027 


0.0050 


1.0202 


KLIGTDNSV 


9 


HBV 


adr 


POL 


1317 


0.0050 


1.0818 


FLLALLSCL 


9 


HCV 




CORE 


177 


0.0046 


1.0184 


LLSSmiSWL 


9 


HBV 


adr 


POL 


992 


0.0046 


1.0102 


QLLRIPQAV 


9 


HCV 




KNVl 


337 


0.0039 


1.0114 


GLRDLAVAV 


9 


HCV 




LORF 


963 


0.0034 
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APPENDIX I: 9 -MER PEPTIDES 











. ; : S train ; ; ( > Mol e cul e : : :: 


Pos . 


/:a2.i. 


1.0005 


TLNAWVKVT 


- 9 


HIV 




GAG 


156 


0.0032 


1.0183 


NLQSLTNLL 


9 


HBV 


adr 


POL 


985 


0.0025 


1.0359 


QLGRKPTPL 


9 


HBV 


adw 


ENV 


89 


0.0025 


1.0150 


SLDSWWTSIi 


9 


HBV 


adr 


BNV 


194 


0.0023 


1.0362 


ILSKTGDPV 


9 


HBV 


adw 


ENV 


153 


0.0021 


1.0866 


ILLWVIiGV 


9 


C-ERB2 






661 


0.0020 


1.0214 


U-HKRTLGL 


9 


HBV 


adr 


"X" 


1510 


0.0019 


1.0216 


CLFKDWEEL 


9 


HBV 


adr 


"X" 


1533 


0. 0019 


1.0862 


GLGISWIiGL 


9 


C-ERB2 






447 


0.0018 


1.0187 


KLLVGSSGL 


9 


HBV 


adr 


POL 


1020 


0.0018 


1.0318 


TLEEITGYL 


9 


C-ERB2 






402 


0.0018 


1.0328 


PIiTSIISAV 


9 


C-ERB2 






650 


0.0015 


1.0822 


LLGCIITSI* 


9 


HCV 




liORF 


1039 


0.0015 


^ 1.0277 


ALNKMFCQL 


9 


pS3 






129 


0.0013 


1.0066 


HLEGKIILV 


9 


HIV 




POL 


1322 


0.0010 


1.0308 


QLRSLTEIL 


9 


C-ERB2 






141 


0.0008 


1.0115 


DIAVAVEPV 


9 


HCV 




LORF 


966 


0.0008 


1.0391 


VLHKRTI/GL 


9 


HBV 


adw 


itx" 


1539 


0.0007 


1.0876 


FLCITJ.LCL 


9 


HBV 


adw 


KNV 


246 


0.0007 


1.0148 


I4LDPRVRGL 


9 


HBV 


adr 


KNV 


120 


0.0006 


1.0221 


KLPQLCTEL 


9 


HPV 


16 


E6 


18 


0.0006 


1.0065 


HLEGKVII.V 


9 


HIV 




POL 


1322 


0.0006 


1.0017 


EMWTACQGV 


9 


HIV 




GAG 


,35.0 


.0.0006 


1.0055 


HIiALQDSGL 


9 


HIV 




POL 


1178 


0.0005 
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APPENDIX I: 9-MER PEPTIDES 





Sequence.-. 






Strain • 


h:*-Moleculei:::' 




l:::::A2;i. 


1.0866 


VLGWFGIL 


9 


C-ERB2 






666 


0.0005 


1.0004 


TLNAWVKW 


9 


HIV 


GAG 


156 


0.0005 


1.0381 


HLESLYAAV 


9 


HBV 




POL 


1165 


0,0005 


1.0128 


CLIRLKPTL 


9 


HCV 


liORF 


1610 


0.0004 


1.0255 


CLGLSYDGL 


9 


MAGE 


1/3 




174 


0.0004 


1.0212 


HLSLRGLPV 


9 


HBV 


adr 


"X" 


1470 


0.0004 


1.0247 


ILESLFRAV 


9 


MAGE 


1 




93 


0.00G4 


1.0092 


TLTCGFADL 


9 


HCV 




CORE 


125 


0.0003 


1.0108 


TLPALSTGL 


9 


HCV 




NS1/ENV2 


683 


0.0003 


1.0294 


ALAIPQCRL 


9 


SBHAl 






525 


0.0003 


1.0101 


DLCGSVFLV 


9 


HCV 




ENVl 


280 


0.0003 


1.0231 


RLCVQSTHV 


9 


HPV 


16 


E7 


66 


0.0003 


1.0162 


LLDDEAGPL 


9 


HBV 


adr 


POL 


587 


0.0002 


1.0829 


CLRRFIIFL 


9 


HBV 


adr 


ENV 


239 


0.0002 


1.0126 


GLPVCQDHL 


9 


HCV 




LORF 


1547 


0.0001 


1.0163 


PLEEELPRL 


9 


HBV 


adr 


POL 


594 


0.0001 


1.0130 


PLLYRLGAV 


9 


HCV 


c 


LORF 


1623 


0.0001 


1.0042 


ELAENREIL 


9 


HIV 




POL 


997 


0 


1.0054 


ELQAIHLAL 


9 


HIV 




POL 


1173 


0 


1.0089 


LIPRRGPRIi 


9 


HCV 




CORE 


36 


0 


1.0091 


NLGKVTDTL 


9 


HCV 




CORE 


118 


0 


1.0093 


PLGGAARAL 


9 


HCV 




CORE 


143 


0 


1.0154 


DLLDTASAL 


9 


HBV 


adr 


CORE 


419 


0 


1.0178 


QLKQSRIiGL 


9 


HBV 


adr 


POL 


791 


0 
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APPENDIX I; 9-MER PEPTIDES 



"!-:^ Peptide.^-' 


Sequence 




l^i-viruaiv^;'.; 


;S train 


.'/Moleciil-e: 


....P.os........ 


..■::"A2,:X . 


1.0179 


GLQPQQGSL 


9 


HBV 


adr 


POL 


798 


0 


1.0286 


PLDGEYFTL 


9 


p53 






322 


0 


1.0296 


VUCDAIKDL 


9 


EBKAl 






574 


0 


1.0310 


QLCYQDTIL 


9 


C-ERB2 






160 


0 


1.0007 


DLNTMLNTV 


9 


HIV 




GAG 


188 


0 


1.0037 


ELHPDKWTV 


9 


HIV 




POL 


928 


0 


1.0070 


ELKKIIGQV 


9 


HIV 




POL 


1412 


0 


1.0157 


EIiWSYVNV 


9 


HBV 


adr 


CORE 


473 


0 


1.0160 


CLTFGRETV 


9 


HBV 


adr 


CORE 


497 


0 


1.0164 


DLNIiGNLNV 


9 


HBV 


adr 


POL 


614 


0 


1.0867 


LliVWLGW 


9 


C-ERB2 






662 


0 


1.0159 




9 


HBV 


adr 


CORE 


482 


0 


1.0322 


SLRELGSGIi 


9 


C-ERB2 






457 


<0 .0002 


1.0350 


DLLEKGERI^ 


9 


C-ERB2 






933 


<0 .0002 


1.0352 


DLVDAEEYL 


9 


C-ERB2 






1016 


<0.0002 


1.0366 


PX^EELFHIi 


9 


HBV 


adw 


POL 


623 


<0 .0002 


1.0372 


DLQHGRLVL 


9 


HBV 


adw 


POL 


781 


<0.0002 


1.0390 


PLPGPLGAL 


9 


HBV 


adw 


"X" 


1476 


<0.0002 


1.0811 


LLTQIGCTL 


9 


HIV 




POL 


685 


<0 .0002 


1.0812 


PLVKLWYQL 


9 


HIV 




POL 


1116 


<0.0002 


1.0832 


FliFIT.TiLCL 


9 


HBV 


adr 


ENV 


246 


<0.0002 


1.0847 


NLYVST.T.TiT. 


9 


HBV 


adr 


POL 


1059 


<0.0002 


1.0316 


PLQPEQLQV 


9 


C-ERB2 






391 


<0.0002 


1.0342 


DLAARNVIiV 


9 


C-ERB2 






845 


<0.0002 
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APPENDIX I: 9-MER PEPTIDES 





■JSeqiiiBiice 






Strain 


; Molecule- 


- :. 




1.0343 


VLVKSPNHV 


9 


C-ERB2 






851 


<0 .0002 


1.0356 


TLSPGKNGV 


9 


C-ERB2 






1172 


<0 .0002 


1.0376 


DLSWLSLDV 


9 


HBV 


adv 


POL 


1025 


<0.0002 


1.0363 


NMENIASGL 


9 


HBV 


adw 


ENV 


163 


<0.0002 


1.0195 


TLPQEHIVL 


9 


HBV 


adr 


POL 


1179 


<0.0003 


1.0196 


KLKQCFRKL 


9 


HBV 


adr 


POL 


1186 


<0.0003 


1.0201 


PLPIHTAEL 


9 


HBV 


adr 


POL 


1296 


<0.0003 


1.0210 


QLDPARDVL 


9 


HBV 


adr 


"X" 


1426 


<0.0003 


1.0220 


VLGGCRHKL 


9 


HBV 


adr 


"X" 


1551 


<0.0003 


1.0229 


DLQPETTDL 


9 


HPV 


16 


E7 


14 


<0.0003 


1.0245 


ALEAQQEAL 


9 


MAGE 


1 




15 


<0.0003 


1.0266 


DLPTQEPAL 


9 


PSA 






136 


<0.0003 


1.0279 


HLIRVEGNL 


9 


P53 






193 


<0.0003 


1.0282 


TL£DSSGm< 


9 


p53 






256 


<0.0003 


1.0238 


ELRHYSDSV 


9 


HPV 


18 


E6 


77 


<0.0003 


1.0268 


DLHVTSNDV 


9 


PSA 






171 


<0.0003 


1.0836 


CLIFLLVLL 


9 


HBV 


adr 


ENV 


253 


<0.0006 
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APPENDIX II: 10-MER PEPTIDES 



.;:i::::::!:P^P'C i.<ift;-;: : : - 


. .-: : d e£4UeZi WC: : : : : 




: :;V, X tlXS; : 


Stirain j .MoiccuXe : : 




:-:-::>l^.».i... 


1 . 0890 


LiLFNIIjGGwV 


10 


HCV 


1 liORF 




3 . 5 


1 . 0930 


liLVP FVQWFV 


10 


HBV 


adw 


ENV 


33 8 


1 . 6 


1.0884 


LLALLSCLTV 


10 


HCV 




CORE 


178 


0 . 61 


1.0896 


ILLLiCliIFIiL 


10 


HBV 


adr 


ENV 


249 


0 .30 


i.osia 


GLSPTVWLSV 


10 


HBV 


adr 


ENV 


348 


0 .28 


1 . 0902 


SLYNIIiSPFL 


10 


HBV 


adr 


ENV 


367 


0 . 23 


1.0892 


liLVLQAGFFL 


10 


HBV 


adr 


ENV 


175 


0 . 21 


1 . 0686 


FLQTHIFAEV 


10 


BBNAl 






565 


0 . 17 


1.0628 


QLFLNTLSFV 


10 


HPV 


18 


B7 


88 


0 . 11 


1. 0904 


IiLPIFFCLWV 


10 


HBV 


adr 


BNV 


378 


0 . io 


1.0897 


LLLCLIFIiLV 


10 


HBV 


adr 


ENV 


250 


0 . 099 


1.0516 


T iT 1 r) YQGMLP V 


10 


HBV 


adr 


BNV 


260 


0 .085 


1.0901 


WMMWyWGPSIi 


10 


HBV 


adr 


BNV 


359 


0 . 084 


1.0533 


GLYS STVPVL 


10 


HBV 


adr 


POL 


635 


0 .'080 


1 . 0469 


YLLPRRGPRL 


10 


HCV 




CORE 


35 


0 . 073 


1 . 0888 


GXiLGCIITSL 


10 


HCV 




liORF 


1038 


0 . 061 


1 . 0907 


I liCwGE LMNIi 


10 


HBV 


adr 


CORE 


449 


0 . 052 


1 . 0927 


Ltlxj I CliTSTv 


lU 


C-ERB2 






785 


0 . 049 


1 . 0452 


LkLWKCi £ GA W 


lU 


HIV 




POL 


1496 


0.036 


1 . 0885 




10 


HCV 




CORE 


178 


0 . 034 


J. . Uo^U 




1 n 

±. u 


rtirV 


18 


B6 




U . U J ^ 


1.0502 


RLIVFPDIiGV 


10 


HCV 




LORF 


2578 


0.032 


1.0659 


FLTPKKLQCV 


10 


PSA 






161 


0.031 


1.0932 


WMMVTFWGPSL 


10 


HBV 


adw 


ENV 


359 


0.029 
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:.;.Seguence; • 




' Virua : 


Strain 




* .'vPOS. 




1.0772 


SUTrLGGTPV 


10 


HBV 


adw 


ENV 


201 


0.027 


1.0609 


SZ-::iEITCV 


10 


HPV 


18 


E6 


24 


0.025 


1.0526 


ILZrLPETTV 


10 


HBV 


adr 


CORE 


529 


0 .022 


1.0508 


RLHGLSAFSL 


10 


HCV 




LORF 


2885 


0.020 


1.0493 


ILGGWVAAQL 


10 


HCV 




liORF 


1811 


0.018 


1.0738 


VMAGVGSPYV 


10 


C-ERB2 






773 


0.018 


1.0460 


QLMVTVYYGV 


10 


HIV 




ENV 


2181 


0.017 


1.0573 


ILRGTSFVYV 


10 


HBV 


adr 


POL 


1345 


0.016 


1.0703 


SLTEILKGGV 


10 


C-ERB2 






144 


0.015 


1.0912 


LLGCAANWIL 


10 


HBV 


adr 


POL 


1337 


0.014 


1.0798 


iU^PPASPSAV 


10 


HBV 


adw 


"X" 


1483 


0.013 


1.0908 


QLLWFHXSCL 


10 


HBV 


adr 


CORE 


489 


0.013 


1.0677 


NLLGRNSFEV 


10 


p53 






263 


0.013 


1.0889 


VLAALAAYCL 


10 


HCV 




LORF 


1666 


0.011 


1.0528 


LLLDDEAGPL 


10 


HBV 


adr 


POL 


586 


0.011 


1.0500 


IMAKNEVFCV 


10 


HCV 




LORF 


2558 


0.0088 


1.0492 


VLVGGVLAAL 


10 


HCV 




LORF 


1661 


0.0084 


1.0898 


LLCLIFLLVL 


10 


HBV 


adr 


ENV 


251 


0.0075 


1.0458 


KLMVTVYYGV 


10 


HIV 




ENV 


2181 


t).0069 


1.0459 


NLMVTVYYGV 


10 


HIV 




ENV 


2181 


0.0067 


1.0530 


GLSPTVWLSA 


10 


HBV 


adw 


ENV 


348 


0.0067 


1.0759 


SLPTHDPSPL 


10 


C-ERB2 






1100 


0.0059 


1.0419 


VLPEKDSWTV 


10 


HIV 




POL 


940 


0.0056 


1.0666 


FLHSGTAKSV 


10 


p53 






113 


0.0050 
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SequMcex::;"; 




; Virus - 


-Strain • 




•;Poa. 


:. Ai*a.. . 


1.0473 


GLIHLHQNIV 


10 


HCV 




NS1/ENV2 


690 


0.0047 


1.0792 


SLYAAVTNFL 


10 


HBV 


adw 


POL 


1168 


0.0046 


1.0780 


IMPARFYPNV 


10 


HBV 


adw 


POL 


713 


0 . 0043 


1.0507 


YLTRDPTTPL 


10 


HCV 




LORF 


2803 


0 .0042 


1.0914 


GLYNLLIRCL 


10 


HPV 


18 


E6 


97 


0 .0036 


1.0649 


YLEYGRCRTV 


10 


MAGE 


1 




246 


0 .0034 


1.0561 


SLFTSITNFL 


10 


HBV 


adr 


POL 


1139 


0.0034 


1.0788 


NLLSSDLSWL 


10 


HBV 


adw 


POL 


1020 


0.0032 


1.0753 


RMARDPQRFV 


10 


C-ERB2 






978 


0.0020 


1.0568 


RMRGTFWPt, 


10 


HBV 


adr 


POL 


1288 


0.0020 


1.0642 


SLQLVFGIDV 


10 


MAGE 


1 




150 


0.0020 


1.0582 


KLLHKRTLGIi 


10 


HBV 


adr 




1509 


0.0019 


1.0713 


GLGMEHLREV 


10 


C-ERB2 






344 


0.0017 


1.0742 


GMSYI-EDVRL 


10 


C-ERB2 






832 


0.0017 


1 .0549 


NLLSSNIiSWIi 


10 


HBV 


adr 


POL 


991 


0.0016 


1.0465 


QLTVWGIKQL 


10 


HIV 




KNV 


2760 


0.0015 


1,0524 


VLEYLVSFGV 


10 


HBV 


adr 


CORE 


505 


0.0015 


1.0483 


VliNPSVAATL 


10 


HCV 




LORF 


1253 


0.0015 


1.0548 


SLTNUoSSNIi 


10 


HBV 


adr 


POL 


988 


0.0014 


1.0512 


ALLDPRVRGL 


10 


HBV 


adr 


ENV 


119 


0.0011 


1.0676 


TLEDSSGNLI. 


10 


P53 






256 


0.0011 


1.0719 


TLQGLGISWL 


10 


C-ERB2 






444 


0.0011 


1.0627 


DLRAFQQLFL 


10 


HPV 


18 


E7 


82 


0.0010 


1.0725 


VLOGLPREYV 


10 


C-ERB2 






546 


0.0009 
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■:/:MoX4cilie::v 




.^;.A2-.:l.:. 


1.0918 


DLPPWFPPMV 


10 


EBNAl 






605 


0.0009 


1.0499 


DLSDGSWSTV 


10 


HCV 




liORF 


2399 


0.0008 


1.0559 


CLAFSYMDDV 


10 


HBV 


adr 


POL 


1118 


0.0008 


1.0632 


PLVLGTLEEV 


10 


MAGE 


1 




37 


0.0008 


1.0520 


NLATWVGSNL 


10 


HBV 


adr 


CORE 


457 


0.0008 


1.0400 


NLLTQIGCTL 


10 


HIV 




POL 


664 


0.0007 


1.0488 


GLTHIDAHFL 


10 


HCV 




IiORF 


1564 


0.0007 


1.0733 


VLGSGAFGTV 


10 


C-ERB2 






725 


0.0007 


1.0434 


QLIKKEKVYL 


10 


HIV 




POL 


1219 ( 


0.0006 


1.0451 


KLLWKGEGAV 


10 


HIV 




POL 


1495 


0.0006 


1.0470 


SMVGNWAKVL 


10 


HCV 




BNVl 


364 


0.0006 


1.0570 


KLIGTDNSW 


10 


HBV 


adr 


POL 


1317 


0.0006 


1,0924 


ILLWVLGW 


10 


C-ERB2 






661 


0.0006 


1.0397 


LLDTGADDTV 


10 


HIV 




POL 


619 


0.0005 


1.0446 


HLKTAVQMAV 


10 


HIV 




POL 


1426 


0.0005 


1.0604 


DLLMGTLGIV 


10 


HPV 


16 


E7 


81 


0.0005 


1.0443 


LLKLAGRWPV 


10 


HIV 




POL 


1356 


0.0004 


1.0461 


DLMVTVYYGV 


10 


HIV 




ENV 


2181 


0.0004 


1.0619 


TLEKLTNTGL 


10 


HPV 


18 


S6 


89 


0.0004 


1.0787 


SL7TILLSSDL 


10 


HBV 


adw 


POL . 


1017 


0.0004 


1.0521 


NLEDPA5REL 


10 


HBV ' 


-adr 


CORE 


465 


0.0003 


1.0583 


GLSAMSTTDL 


10 


HBV 


adr 


•X' 


1517 


0.0003 


1.0652 


VLVASRGRAV 


10 


PSA 






36 


0.0003 


1.0716 


DLSVFQNLQV 


10 


C-ERB2 






421 


0.0003 
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Seqruexicei-^ 






• • : S-train 


V Molecule . 


•U^os - ■* 


■ .:A2,X.: 


1,0723 


QLFRNPHQAL 


10 


C-ERB2 






484 


0.0003 


1.0727 


PLTSIISAW 


10 


C-ERB2 






650 


0 . 0003 


1.0479 


YLKGSSGGPL 


10 


HCV 




LORF 


1160 


0 . 0002 


1.0497 


QLPCEPEPDV 


10 


HCV 




LORF 


2159 


0.0002 


1.0523 


CLTFGRETVL 


10 


HBV 


adr 


CORE 


497 


0.0002 


1.0603 


TLEDLIiMGTL 


10 


HPV 


16 


E7 


78 


0.0002 


1.0631 


SLHCKPEEAL 


10 


MAGE 


1 




7 


0,0002 


1.0680 


EMFREIiNEAL 


10 


p53 






339 


0.0002 


1.0689 


VLKDAIKDLV 


10 


EBKAl 






574 


0.9002 


1.0757 


DLVDAEEYI.V 


10 


C-ERB2 






1016 


0.0002 


1.0796 


RMRGTFVSPL 


10 


HBV 


adw 


POL 


1317 


0.0002 


1.0669 


QLAKTCPVQL 


10 


P53 






136 


0.0001 


1.0717 


NLQVIRGRIL 


10 


C-ERB2 






427 


0.0001 


1.0721 


WLGIiRSLREL 


10 


C-ERB2 






452 


0.0001 


1.0522 


NMGLKIRQLL 


10 


HBV 


adr 


CORE 


482 


0 


1.0527 


PLSYQHFRKL 


10 


HBV 


adr 


POL 


576 


0 


1.0529 


ELPRLADEGL 


10 


HBV 


adr 


POL 


598 


0 


1.0531 


GLNRRVAEDL 


10 


HBV 


adr 


POL 


606 


0 


1.0536 


PLTVNEKRRXi 


10 


HBV 


adr 


POL 


672 


0 


1.0539 


IMPARFYPNL 


10 


HBV 


adr 


POL 


684 


0 


1.0550 


PLHPAAMPHL 


10 


HBV 


adr 


POL 


1012 


0 


1.0552 


DLHDSCSRNl. 


10 


HBV 


adr 


POL 


1051 


0 


1.0555 


LLYKTFGRICL 


10 


HBV 


adr 


POL 


1066 


0 


1.0557 


PIKTVCLSPFL 


10 


HBV 


adr 


POL 


1090 


0 
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f-S-train 


Molecule: . 


• 'Pos.. ; 


.::i:A2::.i;. 


1.0560 


VLGAKSVQHL 


10 


HBV 


adr 


POL 


1128 


0 


1.0569 


PLPIHTAELL 


10 




adr 


POL 


1296 


0 


1.0579 ' 


PLPSLAFSAV 


10 


HB\ 


adr 


•X' 


1454 


0 


1.0585 


DLEAYFKDCL 


10 


HBV 


adr 


•X' 


1525 


0 


1.05B7 


ELGEEIRLKV 


10 


HBV 


adr 


'X' 


1540 


0 


1.0589 


VLGGCRHKLV 


10 


HBV 


adr 


•X' 


1551 


0 


1.0597 


TLEQQYNKPL 


10 


HPV 


16 


E6 


94 


0 


1.0608 


DliCTELNTSL 


10 


HPV 


18 


E6 


16 


0 


1.0616 


RLQRRRETQV 


10 


HPV 


16 


E6 


49 


0 


1.0621 


HLEPQNEIPV 


10 


HPV 


18 


E7 


14 


0 


1.0639 


LUCmAREPV 


10 


MAGE 


1/3 




114 


0 


1.0643 


CLGLSYD6LL 


10 


MAGE 


1/3 




174 


0 


1.0657 


DUSLLKNRFIi 


10 


PSA 






98 


0 


1.0658 


T.T.PT.SEPAEL 


10 


PSA 






119 


0 


1.0663 


PLSQETFSDL 


10 


pS3 






13 


0 


1.0664 


PLPSQAMDDL 


10 


p53 






34 


0 


1.0690 


BLAALCRWGL 


10 


C-ERB2 






2 


0 


1.0692 


RLPASPETHL 


10 


C-ERB2 






34 


0 


1.0699 


RLRIVRGTQL 


10 


C-ERB2 






98 


0 


1.0701 


GLRELQLRSL 


10 


C-ERB2 






136 


0 


1.0730 


QMRILKETEL 


10 


C-ERB2 






711 


0 


1.0732 


ILKETELRKV 


10 


C-ERB2 






714 


0 


1.0754 


PLDSTFYRSL 


10 


C-ERB2 






999 


0 


1.0755 


LLEDDDMGDL 


10 


C-EPB2 






1008 


0 
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Peptide.. = 


. Seguencev.: 




Vlriisj;,; -; . 




^.Molecule' 


: Pos . ... 


:: A2,l.: 


1.0758 


DLGMGAAKGL 


10 


C-ERE2 






1089 


9 


1.0761 


PLPSETDGYV 


10 


C-ERB2 






1119 


0 


1.0763 


TLSPGKNGW 


10 


C-ERB2 






1172 


0 


1.0765 


TLQDPRVRAL 


10 


HBV 


adw 


ENV 


119 


0. 


1 .0768 


NMENIASGIuL 


10 


HBV 


adw 


BNV 


163 


0 


1 .0775 


ELPHLADEGL 


10 


HBV 


adw 


POL 


627 


0 


1.0776 


GLNRPVAEDL 


10 


HBV 


adw 


POL 


635 


0 


1.0777 


PLTVNENRRL 


10 


HBV 


adw 


POL 


701 


0 


1.0790 


LLYKTYGRKL 


10 


HBV 


adw 


POL 


1095 


0 


1.0801 


GLSAMSPTDL 


10 


HBV 


adw 


"X" 


1546 


0 


1.0802 


DLEAYFKDCV 


10 


HBV 


adw 


"X" 


1554 


0 


1.0803 


TLQDPRVRGL 


10 


HBV 


ayw 


ENV 


119 


0 


1-0804 


NMENITSGFL 


10 


HBV 


ayw 


BNV 


163 


' 0 


1.0891 


DLVNLLPAIL 


10 


HCV 




LORF 


1878 


0 


1.0404 


PLTEEKIKAL 


10 


HIV 




POL 


720 


<0 .0002 


1.0409 


QLGIPHPAGL 


10 


HIV 




POL 


786 


<0 .0002 


1.0411 


GLKKKKSVTV 


10 


HIV 




POL 


794 


<0.0002 


1.0450 


PIWKGPAiCLL 


10 


HIV 




POL 


1468 


<0.0002 


1.0476 


DLAVAVEPW 


10 


HCV 




LORF 


966 


<0 .0002 


1.0478 


SLTGRDKNQV 


10 


HCV 




LORF 


1046 


<0.0002 


1.0490 


DLEWTSTWV 


10 


HCV 




• LORF 


1652 


<0.0002 


1.0494 


GliGKVLIDIL 


10 


HCV 




LORF 


1843 


<0.0002 


1.0505 


VLTTSCGNTL 


10 


HCV 




LORF 


2704 


<0.0002 


1.0506 


ELITSCSSNV 


10 


HCV 




LORF 


2781 


<0.0002 
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i :; SequBnce 






viiStrain;-.: 


i :':-Mole'ciiie: 






1.0510 


CLRKLGVPPL 


10 






LORF 


2908 


<0.0002 


1.0511 


PU5FFF2HQL 


10 




adr 


ENV 


10 


<0 .0002 


1.0514 


NMENTTSGFL 


10 


HBV 


adr 


ENV 


163 


<0.0002 
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Appe 
PLE 


> 8-n 


lers 




Source 


Peocide 


AA 


1 


2 


3 


4 


5 


6 


7 


8 


Algorithm 
Score 
(EG2) 


Hu PLP 


10 


8 


C 


L 


V 


G 


A 


P 


F 


A 




Hu PLP 


13 


8 


G 


A 


P 


F 


A 


S 


L 


V 




Hu PLP 


23 


8 


G 


L 


c 


F 


F 


G 


V 


A 




Hu PLP 


39 


8 


A 


L 


T 


G 


T 


E 


K 


L 




Hu PLP 


40 


8 


L 


T 


G 


T 


E 


K 


L 


I - 




Hu PLP 


60 


8 


y 


L 


I 


N 


V 


I 


H 


A 




Ms PLP 


64 


8 


V 


I 


H 


A 


F 


Q 


C 


V 




Hu PLP 


64 


8 


V 


I 


H 


A 


F 


Q 


Y 


V 




Hu PLP 


74 


8 


G 


T 


A 


S 


F 


F 


F 


L 




Hu PLP 


80 


8 


F 


L 


Y 


G 


A 


L 


L 


L 




Hu PLP 


93 


8 


T 


T 


G 


A 


V 


R 


Q 


I 




Hu PLP 


106 


8 


T 


T 


I 


C 


G 


K 


G 


L 




Hu PLP 


131 


8 


Q 


A 


H 


S 


L 


E 


R 


V 




Hu PLP 


152 


8 


F 


V 


G 


I 


T 


Y 


A 


L 




Hu PLP 


154 


8 


G 


I 


T 


Y 


A 


L 


T 


V 




Hu PLP 


155 


8 


I 


T 


Y 


A 


L 


T 


V 


V 




Hu PLP 


157 


8 


Y 


A 


L 


T 


V 


V 


W 


L 




Hu PLP 


158 


8 


A 


L 


T 


V 


V 


W 


L 


L 




Hu PLP 


159 


8 


L 


T 


V 


V. 


w 


L 


L 


V 




Hu PLP 


164 


8 


L 


L 


V 


F 


A 


C 


S 


A 




Hu PLP 


165 


8 


L 


V 


F 


A 


C 


S 


A 


V 




Hu PLP 


167 


8 


F 


A 


C 


S 


A 


V 


P 


V 




Hu PLP 


199 


8 


S 


L 


c 


A 


D 


A 


R 


M 




Hu PLP 


203 


8 


D 


A 


R 


M 


Y 


G 


V 


L 




Hu PLP 


212 


8 


W 


. I 


A 


F 


P 


G 


K 


V 




Hu PLP 


218 


8 


K 


V 


C 


G 


S 


N 


L 


L 




Hu PLP 


224 


8 


L 


L 


s 


I 




K 


T 


A 




Hu PLP 


234 


8 


Q 


M 


T 


F 


H 


L 


F 


I 




Hu PLP 


238 


8 


H 


L 


F 


I 


A 


A 


F 


V 




Hu PLP 


244 


8 


F 


V 


G 


A 


A 


A 


T 


L 




Hu PLP 


247 


8 


A 


A 


A 


T 


L 


V 


S 


L 
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Appendi: 
PLP 8-1 


K II! 
ners 






Source 


Peptide 


AA 




2 


3 


4 


5 


6 


7 


8 


Algorithm 
Score 
(E02) 


Hu PLP 


248 


8 


A 


A 


T 


L 


V 


S 


L 


L 




Hu PLP 


253 


8 


S 


L 


L 


T 


F 


M 


I 


A 






254 


8 


L 


L 


T 


F 


M 


I 


A 


A 






260 


8 


A 


A 


T 


Y 


N 


F 


A 


V 




1 HU PL? 


261 


8 ' 


A 


T 


Y 


N 


F 


A 


V 


L 
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App 
MB 


sndi, 
P 8-1 


X II 
Tiers 


I 








Source 


Pepride 


AA 


1 


2 


3 


4 


5 


6 


7 


8 


Algorithm 
Score 
(E02) 


Hu MBP 


14 


8 


y 


L 


A 


T 


A 


S 


T 


M 




Hu MBP 


34 


8 


D 


T 


G 


I 


L 


D 


S 


I 




Hu MBP 


€5 


8 


R 


T 


A 


H 


Y 


G 


s 


L 




Ms MBP 


70 


8 


H 


A 


R 


S 


R 


P 


G 


L 




Hu MBP 


79 


8 


R 


T 


Q 


D 


E 


N 


P 


V 




Hu MBP 


86 


8 


V 


V 


H 


F 


F 


K 


N 


I 




Ms MBP 


87 


8 


R 


T 


T 


H 


Y 


G 


S 


L 




Hu MBP 


143 


8 


G 


V 


D 


A 


Q 


G 


T 


L 




Hu MBP 


149 


8 


T 


L 


S 


K 


. I 


F 


K 


L 
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Appendix 2 
PLP 9-me2 


:ii 

"S 




Source 


Peptide 


AA 




: 2 


3 


4 


5 


6 


7 


8 


9 


Algorithm 
Score 
(E02) 


Hu PLP 


163 


9 


W 


L 


L 


V 


F 


A 


C 


S 


A 


-18.67 


Hu PLP 


205 


9 


R 


M 


Y 


G 


V 


L 


P 


w 


I 


-18.79 


PLP 


145 


9 


W 


L 


G 


H 


P 


D 


K 


F 


V 


-19.05 


ria PLP 


253 


9 


5 


L 


L 


T 


F 


M 


I 


A 


A 


-19 .07 


Hu PLP 


251 


9 


L 


V 


S 


L 


L 


T 


F 


M 


I 


-20.03 


Hu PLP 


256 


9 


M 


I 


A 


A 


T 


Y 


N 


F 


A 


-20.32 


Hu PLP 


80 


9 


F 


L 


Y 


G 


A 


L 


L 


L 


A 


-20.53 


MS PLP 


205 


9 


R 


M 


Y 


G 


V 


L 


P 


w 


N 


-20.69 


Hu PLP 


64 


9 


V 


I 


H 


A 


F 


Q 


Y 


V 


I 


-20.71 


Hu PLP 


23 


9 


G 


L 


C 


F 


F 


G 


V 


A 


L 


-21.23 


MS PLP 


23 


9 


G 


L 


C 


F 


F 


G 


V 


A 


L 


-21.23 


Ms PLP 


179 


9 


N 


T 


W 


T 


T 


C 


Q 


S 


I 


-21.24 


Hu PLP 


233 


9 


F 


Q 


M 


T 


F 


H 


L 


F 


I 


-21.25 


Hu PLP 


234 


9 


Q 


M 


T 


F 


H 


L 


F 


I 


A 


-21.29 


Hu PLP 


259 


9 


I 


A 


A 


T 


Y 


N 


F 


A 


V 


-21.32 


Hu PLP 


157 


9 


Y 


A 


L 


T 


V 


V 


W 


L 


L 


-21.51 


Hu PLP 


76 


9 


A 


S 


F 


F 


F 


L 


Y 


G 


A 


-21.52 


Hu PLP 


158 


9 


A 


L 


T 


V 


V 


W 


L 


L 


V 


-21.56 


Hu PLP 


252 


9 


V 


S 


L 


L 


T 


F 


M 


I 


A 


-21.58 


Hu PLP 


237 


9 


F 


H 


L 


F 


I 


A 


A 


F 


V ' 


-21.61 


Ms PLP 


208 


9 


G 


V 


L 


P 


W 


N 


A 


F 


P 


-21.61 


Hu PLP 


164 


9 


L 


L 


V 


F 


A 


C 


S 


A 


V 


-21.81 


Hu PLP 


78 


9 


F 


F 


F 


L 


Y 


G 


A 


L 


L 


-22.05 


Hu PLP 


250 


9 


T 


L 


V 


S 


L 


L 


T 


F 


M 


-22.10 


Hu PLP 


208 


9 


G 


V 


L 


P 


W 


I 


A 


F 


P 


-22.10 


Hu PLP 


39 


9 


A 


L 


T 


G 


T 


E 


K 


L 


I 


-22.13 


Hu PLP 


240 


9 


F 


I 


A 


A 


F 


V 


G 


A 


A 


-22.19 


Hu PLP 


235 


9 


M 


T 


F 


H 


L 


F 


I 


A 


A 


-22.22 


Hu PLP 


244 


9 


F 


V 


G 


A 


A 


A 


T 


L 


V 


-22.22 


Ms PLP 


64 


9 


V 


I 


H 


A 


F 


Q 


C 


V 


I 


-22.33 
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Append 
PLP 9 


ix III 
-mers 


Source 


Pepcide 


AA 


1 


2 


3 


4 


5 


6 


7 


8 


9 


Algorithm 
Score 
(E02) 


Hu PLP 12 


9 


V G 


A 


P 


F 


A 


S L 


V 1 -22,36 1 


Hu PLP 


45 


9 


K 


L 


I 


E 


T 


Y 


F 


S 


K 


. -22.42 


Hu PLP 30 


9 


A 1 L 


F 


C 


G 


C 


G 


H 


E 


-22 .46 


Hu PLP 


9 


9 


R 


C 


L 


V 


G 


A 


P 


F 


A 


-22.52 


Hu PLP 


189 


9 


F 1 P 


S 


K 


T 


S 


A 


s . 


I 


-22 .54 


Hu PLP 


71 


9 


V 


I 


Y 


G 


T 


A 


S 


F 


F 


-22.60 


Hu PLP 


73 


9 


Y 


G 


T 


A 


S 


F 


F 


F 


L 


-22 .63 


Hu PLP 


11 


9 


L 


V 


G 


A 


P 


F 


A 


S 


L 


-22.64 


Hu PLP 


86 


9 


L 


L 


A 


E 


G 


F 


Y 


T 


T 


-22.65 


MS PLP 


63 


9 


N 


V 


I 


H 


A 


F 


Q 


C 


V 


-22.65 


Hu PLP 


212 


9 


W 


I 


A 


F 


P 


G 


K 


V 


C 


-22.67 


Hu PLP 


223 


9 


N 


L 


L 


S 


I 


C 


K 


T 


A 


-22.68 


Hu PLP 


199 


9 


S 


L 


C 


A 


D 


A 


R 


M 


Y 


-22 .71 


Hu PLP 


179 


9 




T 


W 


T 


T 


C 


D 


S 


I 


-22.73 


Hu PLP 


201 


9 


C 


A 


D 


A 


R 


M 


Y 


G 


V 


-22.74 


Hu PLP 


112 


9 


G 


L 


S 


A 


T 


V 


T 


G 


G 


-22.78 


Hu PLP 


161 


9 


V 


V 


W 


L 


L 


V 


F 


A 


C 


-22.78 


Hu PLP 


175 


9 


Y 


I 


Y 


F 


N 


T 


W 


T 


T 


-22. 8l" 
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Appendix III 
PLP 9-merB 


Source 


Peptide 


AA 


1 


2 


3 


4 


5 


6 


7 


8 


9 


Algorithm 
. Score 
(E02) 


Hu PLP 


56 


9 


Q 


D 


Y 


E 


Y 


L 


I 


N 


V 


-22.84 


Hu PLP 


241 


9 


I 


A 


A 


F 


V 


G 


A 


A 


A 


-22.87 


Hu PLP 


154 


9 


G 


I 


T 


Y 


A 


L 


T 


V 


V 


-22.89 


Hu PLP 


257 


9 


F 


M 


I 


A 


A 


T 


Y 


N 


F 


-22.89 


Hu PLP 


196 


9 


S 


I 


G 


S 


L 


C 


A 


D 


A 


-22.90 


Hu PLP 


18 


9 


S 


L 


V 


A 


T 


G 


L 


C 


F 


-22.91 


Hu PLP 


261 


9 


A 


T 


Y 


N 


F 


A 


V 


L 


K 


-23.00 


Hu PLP 


171 


9 


A 


V 


P 


V 


Y 


I 


Y 


F 


N 


-23 .05 


Hu PLP 


70 


9 


Y 


V 


I 


Y 


G 


T 


A 


S 


F 


-23.11 


Hu PLP 


22 


9 


T 


G 


L 


C 


F 


F 


G 


V 


A 


-23 .12 


Hu PLP 


134 


9 


S 


L 


E 


R 


V 


C 


H 


C 


L 


-23.16 


Hu PLP 


16 


9 


F 


A 


S 


L 


V 


A 


T 


G 


L 


-23.20 


Hu PLP 


74 


9 


G 


T 


A 


S 


F 


F 


F 


L 


Y 


-23.20 


Hu PLP 


79 


9 


F 


F 


L 


Y 


G 


A 


L 


L 


L 


-23.24 


Hu PLP 


246 


9 


G 


A 


A 


A 


T 


L 


V 


S 


L 


-23.26 


Hu PLP 


181 


9 


W 


T 


T 


C 


D 


S 


I 


A 


F 


-23.27 


Hu PLP 


28 


9 


G 


V 


A 


L 


F 


c 


G 


C 


G 


-23.31 


Hu PLP 


247 


9 


A 


A 


A 


T 


L 


V 


S 


L 


L 


-23.31 


Hu PLP 


219 


9 


V 


C 


G 


S 


N 


L 


L 


S 


I 


-23.33 


Hu PLP 


160 


9 


T 


V 


V 


W 


L 


L 


V 


F 


A 


-23.40 


Hu PLP 


54 


9 


N 


Y 


Q 


D 


Y 


E 


Y 


L 


I 


-23.43 


Hu PLP 


107 


9 


T 


I 


c 


G 


K 


G 


L 


S 


A 


-23.45 


Hu PLP 


166 


9 


V 


F 


A 


C 


S 


A 


V 


P 


V 


-23.53 


Hu PLP 


2 


9 


G 


L 


L 


E 


C 


C 


A 


R 


C 


-23.57 


Hu PLP 


167 


9 


F 


A 


C 


S 


A 


V 


P 


V 


Y 


•23.60 


Hu PLP 


260 


9 


A 


A 


T 


Y 


N 


F 


A 


V 


L 


-23.61 


Hu PLP 


152 


9 


F 


V 


G 


I 


T 


Y 


A 


L 


T 


-23.63 


Hu PLP 


187 


9 


1 


A 


F 


P 


S 


K 


T 


S 


A 


-23.64 


Hu PLP 


63 


9 


N 


V 


I 


H 


A 


F 


Q 


Y 


V 


-23.65 


Hu PLP 


60 


9 


Y 


L 


I 


N 


V 


I 


H 


A 


F 


-23.66 


Hu PLP 


85 


9 


L 


L 


L 


A 


E 


G 


F 


Y 


T 


-23.66 
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Appendix III 
PLP 9-mers 


Source 


Peptide 


AA 


1 


2 


3 


4 


5 


6 


7 


8 


9 


Algorithm 
Score 
(E02) 


Ms PLP 


210 


9 


L 


P 


W 


N 


A 


F 


P 


G 


K 


-23.66 


Hu PLP 


198 


9 


G 


S 


L 


C 


A 


D 


A 


R 


M 


-23.67 


Hu PLP 


20 


9 


V 


A 


T 


G 


L 


C 


F 


F 


G 


-23 .71 


Hu PLP 


263 


9 


Y 


N 


F 


A 


V 


L 


K 


L 


M 


-23.71 


Ms PLP 


209 


9 


V 


L 


P 


W 


N 


A 


F 


P 


G 


-23.71 


Hu PLP 


84 


9 


A 


L 


L 


L 


A 


E 


G 


F 


Y 


-23.73 


Hu PLP 


206 


9 


M 


Y 


G 


V 


L 


P 


W 


I 


A 


-23 .77 


Hu PLP 


153 


9 


V 


G 


I 


T 


Y 


A 


L 


T 


V 


-23. 80 


Hu PLP 


269 


9 


K 


L 


M 


G 


R 


G 


T 


K 


F 


-23.92 


Hu PLP 


138 


9 


V 


C 


H 


C 


L 


G 


K 


W 


L 


-23.99 


Hu PLP 


3 


9 


L 


L 


E 


C 


C 


A 


R 


C 


L 


-24.02 


Hu PLP 


92 


9 


y 


T 


T 


G 


A 


V 


R 


Q 


I 


-24 .40 


Hu PLP 


21 


9 


A 


T 


G 


L 


C 


F 


F 


G 


V 


-24.47 


Hu PLP 


192 


9 


K 


T 


S 


A 


S 


I 


G 


S 


L 


-24.74 


Hu PLP 


38 


9 


E 


A 


L 


T 


G 


T 


E 


K 


L 


-25.72 


Hu PLP 


105 


9 


K 


T 


T 


I 


C 


G 


K 


G 


L 


-26.97 
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Appendix I 
MBP 9-mer 


II 
■s 


Source 


Peptide 


AA 


1 


2 


3 


4 


5 


6 


7 


8 


9 


Algorithm 

Score 
• (E02) 


Hu MBP 


110 


9 


S 


L 


S 


R 


F 


S 


W 


G 


A 


-21.42 


Hu MBP 


14 


9 


Y 


L 


A 


T 


A 


S 


T 


M 


D 


-22.01 


Ms MBP 


59 


9 


W 


L 


K 


Q 


S 


R 


S 


P 


L 


-22.60 


Hu MBP 


86 


9 


V 


V 


H 


F 


F 


K 


N 


I 


V 


-22.80 


Ms MBP 


52 


9 


R 


G 


S 


G 


K 


V 


P 


W 


L 


-22.87 


Hu MBP 


16 


9 


A 


T 


A 


S 


T 


M 


D 


^ H 


A 


-23.11 


Hu MBP 


37 


9 


I 


L 


D 


S 


I 


G 


R 


F 


F 


-23.11 


Hu MBP 


108 


9 


G 


L 


S 


L 


5 


R 


F 


S 


W 


-23.34 


Hu MBP 


93 


9 


I 


V 


T 


P 


R 


T 


P 


P 


P 


-23.41 


Ms MBP 


63 


9 


S 


R 


S 


P 


L 


P 


S 


H 


A 


-23.47 


Hu MBP 


79 


9 


R 


T 


Q 


D 


E 


N 


P 


V 


V 


-23.49 


Hu MBP 


129 


9 


G 


R 


A 


S 


D 


Y 


K 


S 


A 


-23.53 


Hu MBP 


21 


9 


M 


D 


H 


A 


R 


H 


G 


F 


L 


-23.60 


Hu MBP 


160 


9 


n 


S 


R 


S 


G 


S 


P 


M 


A 


-23.63 


Ms MBP 


75 


9 


P 


G 


L 


C 


H 


M 


Y 


K 


D 


-23.64 


Hu MBP 


112 


9 


s 


R 


F 


s 


W 


G 


A 


E 


G 


-23.77 


Hu MBP 


162 


9 


R 


S 


G 


s 


P 


M 


A 


R 


R 


-23,77 


Hu MBP 


159 


9 


R 


D 


S 


R 


S 


G 


S 


P 


M 


-23.81 


Hu MBP 


85 


9 


P 


V 


V 


H 


F 


F 


K 


N 


I 


-23.82 


Hu MBP 


136 


9 


S 


A 


H 


K 


G 


F 


K 


G 


V 


-23.90 


Hu MBP 


149 


9 


T 


L 


S 


K 


I 


F 


K 


L 


G 


-23.90 


Ms MBP 


162 


9 


K 


G 


F 


K 


G 


A 


Y 


D 


A 


-23.92 


Hu MBP 


64 


9 


A 


R 


T 


A 


H 


Y 


G 


S 


L 


-23.99 


Ms MBP 


166 


9 


G 


A 


Y 


D 


A 


Q 


G 


T 


L 


-24.66 


Hu MBP 


148 


9 


G 


T 


L 


S 


K 


I 


F 


K 


L 


-24.78 


Hu MBP 


145 


9 


D 


A 


Q 


G 


T 


L 


S 


K 


I 


-25.25 
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Appendix III 
PLP 10-mers 


Source 


Peocide 


AA 


1 


2 


3 


4 


5 


6 


7 


8 


9 


1 U 


Al cro2ri timi 
Score 

( £0^ i 


MS PLP 1 178 


10 


F 


N 


T 


W 


T 


T C 


Q 


S 


I 


-24 . 68 


Hu PLP 


178 


10 


F 


N 


T 


W 


T 


T 1 C 


D I 


S 


I 


-25 . 14 


Hu PLP 204 


10 


A 


R 


M 


Y 


G 


V L 


P 


w 


I 


-25 .48 


Hu PLP 


163 


10 


W 


L 


L 


V 


F 


A 1 C 


S 


A 


V 


-25 . 66 


Hu PLP 218 


10 


K 


V 


C 


G 


S 


N L 


L 


S 


I 


-25.89 


Hu PLP 


250 


10 


T 


L 


V 


S 


L 


L T 


F 


M 


I 


-26.00 


Hu PLP 


19 


10 


L 


V 


A 


T 


G 


L C 


F 


F 


G 


-26 . 25 


Hu PLP 


78 


10 


F 


F 


F 


L 


Y 


G A 


L 


L 


L 


-26.68 


Hu PLP 


157 


10 


Y 


A 


L 


T 


V 


V 


W 


L 


L 


V 


-26 .72 


Hu PLP 1 84 


lb 


A 


L 


L 


L 


A 


E 1 G 


F 


Y 


T 


-26 .77 


Hu PLP 


233 


10 


F 


Q 


M 


T 


F 


H 


L 


F 


I 


A 


-26 .78 


Hu PLP 


80 


10 


F 


L 


Y 


G 


A 


L 


L 


L 


A 


E 


-26.79 


Hu PLP 


167 


10 


F 


A 


C 


S 


A 


V 


P 


V 


Y 


I 


-27 .28 


Hu PLP 


165 


10 


L 


V 


F 


A 


C 


S 


A 


V 


P 


V 


-27 .32 


Hu PLP 


4 


10 


L 


E 


C 


C 


A 


R 


C 


L 


V 


G 


-27 .36 


Hu PLP 


253 


10 


S 


L 


L 


T 


F 


M 


I 


A 


A 


T 


-27 :42 


Hu PLP 


135 


10 


L 


E 


R 


V 


C 


H 


C 


L 


G 


K 


-27'. 48 


Hu PLP 


176 


10 


I 


Y 


F 


N 


T 


W 


T 


T 


C 


D 


-27 . 62 


Hu PLP 


24 


10 


L 


C 


F 


F 


G 


V 


A 


L 


F 


C 


- 27 . 74 


Hu PLP 


146 


10 


L 


G 


H 


P 


D 


K 


F 


V 


G 


I 


-27.88 


Hu PLP 


237 


10 


F 


H 


L 


F 


I 


A 


A 


F 


V 


G 


-27 .95 


Hu PLP 


56 


10 


Q 


D 


Y 


E 


Y 


L 


I 


N 


V 


I 


-27 .99 


MS PLP 


204 


10 


A 


R 


M 


Y 


G 


V 


L 


P 


W 


N 


-28.01 


Hu PLP 


158 


10 


A 


L 


T 


V 


V 


W 


L 


L 


V 


F 


-28 .04 


Hu PLP 


137 


10 


R 


V 


C 


H 




L 


G 


tr 

K. 


ij 
w 


T. 

Lt 




Hu PLP 


72 


10 


I 


Y 


G 


T 


A 


S 


F 


F 


F 


L 


-28.16 


Hu PLP 


63 


10 


N 


V 


I 


H 


A 


F 


Q 


Y 


V 


I 


-28.17 


Hu PLP 


208 


10 


G 


V 


L 


P 


W 


I 


A 


F 


P 


G 


-28.17 


Hu PLP 


27 


10 


F 


G 


V 


A 


L 


F 


C 


G 


C 


G 


-28,29 


Hu PLP 


85 


10 


L 


L 


L 


A 


E 


G 


F 


Y 


T 


T 


-28.32 


Ms PLP 


62 


10 


I 


N 


V 


I 


H 


A 


F 


Q 


C 


V 


-28.33 
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Appendix III 
PLP 10-mers 


Source 


Peptide 


AA 


1 


1 

I 


4 




6 




8 


9 


10 


Algorithm 
Score 
(202 ) 


Hu PLP t 222 


X u 


c 


N 


- 


T. 


c 


I 






T 






Hu PLP 


76 


1 o 




S 


F 


r 


r 


T, 
Xi 


V 

X 


\3 


>i 


T 
ij 


- 28 .43 


PLP 


208 


X u 


rz 
\j 


V 


L 


tr 


W 


Li 




1? 

r 


r* 


r» 


- 2 8 .45 


1 \P 


207 


X u 


I 


G 


V 


T 
Xj 


tr 


W 
n 


X 




rt 
r 


r 


- 28 .46 


Hu PLP 


79 


X u 




F 


L 


V 
X 






T. 
Xj 


T. 
IJ 


T. 
Xj 


>i 


i; 

^ O AC 

-28.45? 


Hu PLP 


236 


X u 




r 


n 


T 

Xj 


r 


T 
X 


>* 


n 
A 


r 


V 


-23,50 


Hu PLP 


240 


X u 


IT 


I 


A 


7\ 

>i 


17 

r 


V 




TV 

>i 




A 


- 28 .51 


Hu PLP 


181 


10 


W 


T 


T 


c 


D 


s 


I 


A 


F 


P 


-28.56 


Hu PLP 


224 


10 


L 


L 


S 


I 


C 


K 


T 


A 


E 


F 


-28.56 


Hu PLP 


10 


10 


C 


L 


V 


G 


A 


P 


F 


A 


S 


L 


-28.62 


Hu PLP 


152 


10 


F 


V 


G 


I 


T 


Y 


A 


L 


T 


V 


-28.64 


Hu PLP 


62 


10 


I 


N 


V 


I 


H 


A 


F 


Q 


Y 


V 


•28.64 


:^u PLP 


214 


10 


A 


F 


P 


G 


K 


V 


C 


G 


s 


N 


•28.65 


Hu PLP 


188 


10 


A 


F 


P 


S 


K 


T 


s 


A 


S 


I 


-28.65 


Hu PLP 


99 


10 


Q 


I 


.F 


G 


D 


Y 


K 


T 


T 


I 


-28.69 


Hu PLP 


18 


10 


S 


L 


V 


A 


T 


G 


L 


C 


F 


F 


-28.73 


Hu PLP 


3 


10 


L 


L 


E 


C 


C 


A 


R 


C 


L 


V 


-28.75 
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Appendix III 
PLP lO-mers 


Source 


Peocide 


AA 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 
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17 


10 


A 


S 


L 


V 


A 


T 


G 


L 


C 


F 


-28 .76 


Hu PLP 


144 


10 


K 


w 


L 


G 


H 


P 


D 


K 


F 


V • 


-28 .78 


MS PLP 


181 


10 


w 


T 


T 


C 


Q 


S 


I 


A 


F 


P 


-28 .78 


Hu PtiP 


159 


10 


L 


T 


V 


V 


w 


L 


L 


V 


F 


A 


-28.79 


Hu PLP 


174 


10 


V 


Y 


I ■ 


Y 


F 


N 


T 


W 


T 


T 


-28.80 


Hu PLP 


248 


10 


A 


A 


T 


L 


V 


S 


L 


L 


T 


F 


-28.84 


Hu PLP 


23 


10 


G 


L 


C 


F 


F 


G 


V 


A 


L 


F 


-28.87 


Hu PLP 


209 


10 


V 


L 


P 


W 


I 


A 


F 


P 


G 


K 


-28.87 


Hu PLP 


29 


10 


V 


A 


L 


F 


C 


G 


C 


G 


H 


E 


-28.90 


Hu PLP 


261 


10 


A 


T 


Y 


N 


F 


A 


V 


L 


K 


L 


-28.92 


Ms PLP 


63 


10 


N 


V 


I 


H 


A 


F 


Q 


C 


V 


I 


-28.93 


Hu PLP 


74 


10 


G 


T 


A 


S 


F 


F 


F 


L 


Y 


G 


-28:93 


Hu PLP 


259 


10 


I 


A 


A 


T 


Y 


N 


F 


A 


V 


L 


-29,06 


Hu PLP 


242 


10 


A 


A 


F 


V 


G 


A 


A 


A 


T 


L 


-29 .24 


Hu PLP 


2 


10 


G 


L 


L 


E 


C 


C 


A 


R 


C 


L 


-29; 30 


Hu PLP 


257 


10 


F 


M 


I 


A 


A 


T 


Y 


N 


F 


A 


-29:37 


Hu PLP 


20 


10 


V 


A 


T 


G 


L 


C 


F 


F 


G 


V 


-29 ;41 


Ms PLP 


205 


10 


R 


M 


Y 


G 


V 


L 


P 


W 


N 


A 


-29*43 


Hu PLP 
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10 


I ■ 


T 


Y 


A 


L 


T 


V 


V 


W 


L 


-29 .60 


Hu PLP 


30 


10 


A 


L 


F 


C 


G 


C 


G 


H 


E 


A 


-29 .70 


Hu PLP 


205 


10 


R 


M 


Y 


G 


V 


L 


P 


W 


I 


A 


-29 .74 


Hu PLP 


258 


10 


M 


I 


A 


A 


T 


Y 


N 


F 


A 


V 


-30.06 


Hu PLP 


234 


10 


Q 


M 


T 


F 


H 


L 


F 


I 


A 


A 


-30.29 


Hu PLP 


238 


10 


H 


L 


F 


I 


A 


A 


F 


V 


G 


A 


-30.64 


Hu PLP 


246 


10 


G 


A 


A 


A 


T 


L 


V 


s 


L 


L 


-30.64 


Hu PLP 


38 


10 


E 


A 


L 


T 


G 


T 


E 


K 


L 


I 


-30.92 


Hu PLP 


230 


10 


T 


A 


E 


F 


Q 


M 


T 


F 


H 


L 


-31.03 


Hu PLP 


11 


10 


L 


V 


G 


A 


P 


F 


A 


S 


L 


V 


-31-25 


Hu PLP 


201 


10 


C 


A 


b 


A 


R 


M 


Y 


G 


V 


L 


-31.73 
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Append! 
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Source 


Peptide 


AA 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


11 
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Score 
(E02) 


Hu PLP 


2 


11 


G 


L 


L 


E 


C 


C 


A 


R 


C 


L 


V 




Hu PLP 


10 


11 


C 


L 


V 


G 


A 


P 


F 


A 


s 


L 


V 




Hu PLP 


19 


11 


L 


V 


A 


T 


G 


L 


C 


F 


F 


G 


V 




Hu PLP 


21 


11 


A 


T 


G 


L 


C 


F 


F 


G 


V 


A 


L 




Hu PLP 


30 


11 


A 


L 


F 


C 


G 


C 


G 


H 


E 


A 


L 




Hu PLP 


SX 


11 


L 


I 


N 


V 


I 


H 


A 


F 


Q 


V 


V 




! PLP 


61 


11 


L 


I 


N 


V 


I 


H 


A 


F 


Q 


C 


V 




Hu PLP 


71 


11 


V 


I 


Y 


G 


T 


A 


S 


F 


F 


■ F 


L 




Hu PLP 


75 


11 


T 


A 


S 


F 


F 


F 


L 


Y 


G 


A 


L 




Hu PLP 


86 


11 


L 


L 


A 


E 


G 


F 


Y 


T 


T 


G 


A 




Hu PLP 


87 


11 


L 


A 


E 


G 


F 


Y 


T 


T 


G 


A 


V 




Hu PLP 


107 


11 


T 


I 


C 


G 


K 


G 


L 


S 


A 


T 


V 




i Hu PLP 


145 


11 


W 


L 


G 


H 


P 


D 


K 


F 


V 


G 


I 




Hu PLP 
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11 


F 


V 


G 


I 


T 


Y 


A 


L 


T 


V 
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G 


I 


T 
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A 


L 


T 


V 


V 


W 


L 
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I 


T 


Y 


A 


L 


T 


V 


V 


W 


L 
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11 


A 


L 


T 


V 


V 


W 


L 


L 


V 


F 


A 
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11 


L 


L 


V 
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A 
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S 


A 


V 


P 


V 
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11 


I 


A 


F 


P 


S 


K 


T 


S 


A 


S 


I 




Hu Pi.r 
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11 


S 


L 


C 


A 


D 


A 


R 


M 


Y 


G 


V 




Hu PLP 


203 


11 


D 


A 


R 


M 


Y 


G 


V 


L 


P 


W 


I 




Hu PLP 


209 


11 


V 


L 


P 


W 


I 


A 


F 


P 


G 


K 


V 




Ms PLP 


209 


11 


V 


L 


P 


W 


N 


A 


F 


P 


G 


K 


V 




Hu PLP 
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11 


K 


T 


A 


E 


F 


Q 


M 


T 


F 


H 


L 




Hu PLP 
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11 


H 


T 


F 


H 


L 


F 


I 


A 


A 


F 


V 
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11 


H 


L 


F 


I 


A 


A 


F 


V 


G 


A 
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A 


A 


F 


V 


G 


A 


A 


A 


T 


L 




Hu PLP 
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A 


A 


F 


V 


G 


A 


A 


A 


T 


L 


V 




Hu PLP 
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11 


F 


V 


G 


A 


A 


A 


T 


L 


V 


S 


L 




Hu PLP 
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A 


T 


L 


V 


S 


L 


L 


T 


F 


M 
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F 
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28 


10 


F 


L 
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R 
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D 


T 


G 
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10 


A 


Y 


D 


A 


Q 


G 


T 


L 


S 


K 


-28.54 


Hu MBP 


89 


10 


F 


F 
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N 


I 


V 


T 


p 


R 


T 
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Hu MBP 


14 


10 


Y 


L 


A 


T 


A 


s 


T 


M 


D 


H 


-28.75 


Hu MBP 


84 


10 


N 


P 


V 


V 


H 


F 


F 


K 


N 


I 


-28 .80 
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32 


10 


H 


R 


D 


T 


G 


I 


L 


D 


S 


I 
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Hu MBP 
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S 


L 


S 


R 


F 


S 


W 


G 


A 
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85 


10 


P 


V 


V 


H 


F 


F 


K 


N 


I 


V 


-30.82 
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85 


10 


H 


T 


R 


T 


T 


H 


Y 


G 


S 


L 


-31.29 


Hu MBP 


20 


10 


T 


M 


D 


H 


A 


R 


H 


G 


F 


L 


-31.40 


Hu MBP 


63 


10 


P 


A 


R 


T 


A 


H 


Y 


G 


S 


L 


-31.76 


MS MBP 


48 


10 


G 


A 


P 


K 


R 


G 


S 


G 


K 
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WHAT IS CLAIMED IS : 

1. A composition comprising an immunogenic peptide 
having an HLA-A2.1 t.:r.ding motif, which ' immunogenic peptide 

5 has 9 residues, and the following residues: 

a first conserved residue at the second position 
from the N- terminus selected from the group consisting of I, 
V, A and T; 

a second conserved residue at the C- terminal 
"-0 position selected from the group consisting of V, L, I, a and 
M. 

2. A composition comprising an immunogenic peotide 
having an HLA-A2.1 binding motif, which immunogenic peptide 

15 has 9 residues: 

a first conserved residue at the second position 
from the N- terminus selected from the group consisting of L, 
M, I, V, A and T; 

a second conserved residue at the C- terminal 
20 position selected from the group consisting of A and M; 

3. The composition of claim l, wherein the amino 
acid at position 1 is not an amino acid selected from the 
group consisting of D, and P. 

25 

4. The composition of claim 2, wherein the amino 
z at position 1 is not an amino acid selected from the 

.....up consisting of D, and P. 

30 5. The composition of claim 1, wherein the amino 

acid at position 3 from the N- terminus is not an amino acid 
selected from the group consisting of D, E, R, K and H. 

6. The composition of claim 2, wherein the amino 
35 acid at position 3 from the N- terminus is not an amino acid 
selected from the group consisting of D, E, R, K and H 



10 
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7. The composition of claim 1, wherein the amino 
acid at position 6 from the N- terminus is not an amino acid 
selected from the group consisting of R, K and H. 

8. The composition of claim 2, wherein the amino 
acid at position 6 from the N- terminus is not an amino acid 
selected from the group consisting of R, K and H. 

9. The composition of claim l, wherein the amino 
acid at position 7 from the N- terminus is not an amino acid 
selected from the group consisting of R, K, H, D and E. 



10. The composition of claim 2, wherein the amino 
acid at position 7 from the N- terminus is not an amino acid 

15 selected from the group consisting of R, K, D and E. 

11. A composition comprising an immunogenic peptide 
having an HIiA-A2.1 binding motif, which immunogenic peptide 
has about 10 residues: 

20 a first conserved residue at the second position 

from the N- terminus selected from the group consisting of L, 
M, I, V, A, and T; and 

a second conserved residue at the C- terminal 
position selected from the group consisting of V, I, L, A and 

25 M; 

wherein the first and second conserved residues are 
separated by 7 residues . 

12. The composition of claim 11,. wherein the amino 
30 acid at position 1 is not an amino acid selected from the 

group consisting of D, E ajad P. 



35 



13. The composition of claim 11, wherein the amino 
acid at position 3 from the N- terminus is not an amino acid 
selected from the group consisting of D and E. 
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14. The composition of claim 11, wherein the amino 
acid at position 4 from the N- terminus is not an amino acid 
selected from the group consisting of A, K, R and H. 

15. The composition of claim 11, wherein the amino 
acid at positon 5 from the N- terminus is not P. 

16. The composition of claim 11, wherein the amino 
acid at position 7 from the N- terminus is not an amino acid 
selected from the group consisting of R, K and H. 

17. The composition of claim 11, wherein the amino 
acid at position 8 from the N- terminus is not an amino acid- 
selected from the group consisting of D, E, R, K and H. 

18. The composition of claim 11, wherein the amino 
acid at position 9 from the N- terminus is not an amino acid 
selected from the group consisting of R, K and H. 

19. A pharmaceutical composition comprising a 
pahramceutically acceptable carrier and a therapeutically 
effective amount of a peptide capable of binding an HLA-A2.1 
moelcule and inducing an immune response in a mammal. 

20. The pharmaceutical composition of claim 19, 
wherein the peptide has a formula as follows: TLGIVCPI. 

21. The pharamceutical composition of claim 19, 
further comprising a peptide having a formula as fellows: 
YMLDLQPETT . 

22. The pharmaceutical composition of claim 19, 
further comprising a T helper peptide. 

23. The pharmaceutical composition of claim 22, 
wherein the T helper peptide has a formula as follows: 
cLKXVAAWTLKAAa, wherein a is D- alanine and X is 
cyclohexylalanine . 
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HLA-A PURIFICATION AND 
PEPTIDE ELUTION 



CELLULAR SOURCE= HLA-ANTIGENS 
(5-IOxl09 CELL EQUIVALENTS) 



DETERGENT LYSIS 
(lOB CELLS/ml) 



DETERGENT LYSATE 



♦ 

AFFINITY CHROMATOGRAPHY 



f 

PURIFIED HLA-A ANTIGEN 



ACID TREATMENT 



A) EBV TRANSFORMED B CELL 
LINES - HOMOZYGOUS 

B) HLA-A TRANSFECTANTS - 
e.g. .221-HLA-AI 

C) P8I5 TRANSFECTANTS 
(MOUSE MASTOCYTOMA) 

1% NP-40 OR 1% RENEX 30 PLUS 
PROTEASE INHIBITORS - I HR, 4»C 



CENTRIFUGATION AT 15,000 xg, 
30 MIN. 



mAb-SEPHAROSE 5 mg/ml 
5-10 ml COLUMN 



ANTICIPATED YIELDS 450-900 M9 



10% ACETIC ACID, 5 MIN, lOO'C 



PEPTIDES 



YM3 FILTRATION, 3kD CUT-OFF 



SEQUENCE/MOTIF 



D. HUNT - HPLC/EI-TMS 
CYTEL- HPLC/ABI 477A 



F!6, L 



SUBSTITUTE SHEET (RULE 26) 
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SCATTERGRAM FOR COLUMNS' XI Yl r2 = .578 



S 0' 



2 -2 
S -3 
-4 
-5 



A2.I LOG ADJ BND 

FIG. 4, 



' ■ 

©^o cP oo o 
S — fluu ^ o — 


o I- 
o ; 










@ 00 o 




4.5 -4 -3.5 -3 -2.5 -2 -li -1 -.5 0 


.5 



O LOG(X) OF ADJ. 



CO 



SCATTERGRAM FOR COLUMNS: XIYI r2-.7I8 




-4.5 -4 -3.5 -3 



-2.5 -2 -1.5 
LOG OF BINDING 

FIG, 5. 



-.5 



O GOI SUN 



.5 I 



SUBSTITUTE SHEET (RULE 26) 



INTE31NATIONA1. SEARCH REPORT 



li lauonai appucauon Nc 
PCT/US94/02353 



A. CLASSinCATION OF SUBJECT MATTER 

IPaS) :A61K 37/02; C07K 7/06 
US CL :424/88; 530/328 
According to intemationai Patent Classification iIPC^ or to both national classification and IPC 



B. 



RELDS SEARCHED 



Minimum documenution searched (classification system toUowed by ciassiiicauon symoolsi 
U.S. : 424/88: 530/328, 868; 514/885 



Documentation searched other than mmimum documentation to the extent that such documents arc included in the fields searched 



Electronic data base consulted during the intemauonal search (name of data base and. where pracucable. search terms used) 
APS. Medline, CAS Registry 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 



Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



X 
Y 



Journal of Immunolgy, Volume 147, No. 11, issued 01 
December 1991, Sette et aU "Random Association Between 
the Peptide Repertiore of A2.1 Class I and Several Different 
DR Class 11 Molecules", pages 3893, see page 3897-3900, 
Table III. 

Science, Volume 255, issued 06 March 1992, Henderson et 
al, "HLA- A2.1 Associated Peptides from a Mutant Cell Line: 
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